SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



CD 

m 

CO 
H 

I 



(i) APPLICANT: MAERTENS, GEERT 

BGSMAN, FONS J> 

DE MARTYNOFF, GUY QJ 

BUYSE, MARIE-ANGE C 

m 

(ii) TITLE OF INVENTION: PURIFIED HEPATITIS C VIRUS ENVELOPE Q 
PROTEINS FOR DIAGNOSTIC AND THERAPEUTIC USE Q 

(iii) NUMBER OF SEQUENCES: 111 •< 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: NIXON & VANDERHYE P.C. 

(B) STREET: 1100 NORTH GLEBE ROAD 

(C) CITY: ARLINGTON 

(D) STATE: VIRGINIA 

(E) COUNTRY: U.S.A. 

(F) ZIP: 22201-4714 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
(3) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/612,973 

(B) FILING DATE: ll-MAR-1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: BYRNE, THOMAS £. 

(B) REGISTRATION NUMBER: 32,205 

(C) REFERENCE/ DOCKET NUMBER: 1487-10 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (703) 816-4000 

(B) TELEFAX: (703) 816-4100 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 






Tyr 


His 


Val 
35 


Thr 


Asn 


Asp 


Cys 


Ser 
40 


Asn 


Ser 


Ser 


He 


Val 
45 


Tyr Glu 


Ala 






GCG 
Ala 


GAC 
Asp 
50 


ATG 
Met 


ATC 
He 


ATG 
Met 


CAC 
His 


ACC 
Thr 
55 


CCC 
Pro 


GGG TGC 
Gly Cys 


GTG 
Val 


CCC 
Pro 
60 


TGC 
Cys 


GTT 
Val 


CGG 
Arg 


GAG 
Glu 


192 




AAC 
Asn 
65 


AAC 
Asn 


TCT 
Ser 


TCC 
Ser 


CGC 
Arg 


TGC 
Cys 
70 


TGG 
Trp 


GTA 
Val 


GCG 
Ala 


CTC 
Leu 


ACC 
Thr 
75 


CCC 
Pro 


ACG 
Thr 


CTC 
Leu 


GCA 
Ala 


GCT 
Ala 
80 


240 




AGG 
Arg 


AAC 
Asn 


GCC 
Ala 


AGC 
Ser 


GTC 
Val 
85 


CCC 
Pro 


ACC 
Thr 


ACG 
Thr 


ACA 
Thr 


ATA 
He 
90 


CGA CGC 
Arg Arg 


CAC 
His 


GTC 
Val 


GAT 
Asp 
95 


TTG 
Leu 


288 




CTC 
Leu 


GTT 
Val 


GGG 
Gly 


GCG 
Ala 
100 


GCT 
Ala 


GCT 
Ala 


CTC 
Leu 


TGT 
Cys 


TCC 
Ser 
105 


GCT 
Ala 


ATG 
Met 


TAC 
Tyr 


GTG 
Val 


GGG GAT 
Gly Asp 
110 


CTC 
Leu 


336 




TGC 
Cys 


GGA 
Gly 


TCT 
Ser 

115 


GTC 
Val 


TTC 
Phe 


CTC 
Leu 


GTC 
Val 


TCC 
Ser 
120 


CAG 
Gin 


CTG 
Leu 


TTC 
Phe 


ACC 
Thr 


ATC 
He 
125 


TCG 
Ser 


CCT 
Pro 


CGC 
Arg 


384 


Ui 


Arg 


CAT 
His 
130 


GAG 
Glu 


ACG 
Thr 


GTG 
Val 


CAG 
Gin 


GAC 
Aso 
135 


TGC 
Cys 


AAT 
Asn 


TGC 
Cys 


TCA 
Ser 


ATC 
He 
140 


TAT 
Tyr 


CCC 
Pro 


GGC 
Gly 


CAC 
His 


432 


Vas'' 


ATA 
He 
145 


ACA 
Thr 


GGT 
Gly 


CAC 
His 


CGT 
Arg 


ATG 
Met 
150 


GCT 
Ala 


TGG 
Trp 


GAT 
Asp 


ATG 
Met 


ATG 
Met 
155 


ATG 
Met 


AAC 
Asn 


TGG 
Trp 


TCG 
Ser 


CCT 
Pro 
160 


480 


rfj 


ACA 
Thr 


ACG 
Thr 


GCC 
Ala 


CTG 
Leu 


GTG 
Val 
165 


GTA 
Val 


TCG 
Ser 


CAG 
Gin 


CTG 
Leu 


CTC 
Leu 
170 


CGG 
Arg 


ATC 
He 


CCA 
Pro 


CAA 
Gin 


GCT 
Ala 
175 


GTC 
Vai 


528 




GTG 
Vai 


GAC 
Asp 


ATG 
Met 


GTG 
Val 
180 


GCG 
Ala 


GGG 
Gly 


GCC 
Ala 


CAT 
His 


TGG 
Tro 
185 


GGA 
Gly 


GTC 
Val 


CTG 
Leu 


GCG 
Ala 


GGC 
Gly 
190 


CTC 
Leu 


GCC 
Ala 


576 




TAC 
Tyr 


TAT 
Tyr 


TCC 
Ser 
195 


ATG 
Met 


GTG 
Val 


GGG 
Gly 


AAC 
Asn 


TGG 
Trp 
200 


GCT 
Ala 


AAG 
Lys 


GTT 
Val 


TTG 
Leu 


ATT 
He 
205 


GTG 
Val 


ATG 
Met 


CTA 
Leu 


624 




CTC 
Leu 


TTT 
Phe 


GCT 
Ala 


CTC 
Leu 


TAATAG 






















642 



210 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 
15 10 15 





Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Met 
20- 25 30 

Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr Glu Ala 
35 40 45 

Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 
50 55 60 

Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 
65 70 75 80 

Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 
85 90 95 

Leu Val Gly Ala Ala Ala Leu Cys Ser Ala Met Tyr Val Gly Asp Leu 
q 100 105 110 

f£ys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr lie Ser Pro Arg 
Jt; 115 120 125 

^Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 
L! 130 135 140 

[Tile Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp Ser Pro 
"145 150 155 160 

Hrh*- Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro Gin Ala Val 
SI 165 170 175 

U Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala Gly Leu Ala 
g 180 185 190 

^Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu He Val Met Leu 
195 200 205 



Leu Phe Ala Leu 
210 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE : NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..792 

(ix) FEATURE: 



HCVENV.-T 



. (A) NAME /KEY : mat_peptide 
(B) LOCATION: 1..789 

■ ■■■■>■ --^m 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 48 
Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 14 4 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

%CA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 

5rhr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
^ 50 55 60 

y^TG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 240 

Uteu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Vai Arg Asn Vai 

^ 65 70 75 30 

'"^CC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 
Lser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
U 85 90 95 

BT AT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 

ffTyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
%[ 100 105 110 

H^TT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 

115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 4 32 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT TCC GCT ATG TAC GTG- 480 
Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

GGG GAC CTC TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC 528 
Gly Asn Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 

TCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT 57 6 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 

CCC GGC CAC ATA ACG GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC 624 
Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 

TGG TCG CCT ACA ACG GCC CTG GTG GTA TCG CAG CTG CTC CGG ATC CCA 672 
Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro 
210 215 220 



HCVENV.-T 




CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA GTC CTG GCG 720 
Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala 
225 230 235 240 

GGT CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG GTT TTG ATT 768 
Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu He 
245 250 255 

GTG ATG CTA CTC TTT GCT CCC TAATAG "795 

Val Met Leu Leu Phe Ala Pro 
260 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
„, (A) LENGTH: 263 amino acids 

y (B) TYPE: amino acid 

y=i (D) TOPOLOGY: linear 

LJui 

yg (ii) MOLECULE TYPE: protein 

H (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Hdet Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
Ul l 5 10 15 

5 

O/al Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

tjua Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Mrhr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 30 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

Val Asn Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 * 150 155 160 

Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 



HCVENV.-T 



Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 

Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro 
210 215 220 

Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala 
225 230 235 240 

Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu He 
245 250 255 

Val Met Leu Leu Phe Ala Pro 
260 



(2) INFORMATION FOR SEQ ID NO: 7: 



"f! (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 633 base pairs 
^ (B) TYPE: nucleic acid 

%P (C) STRANDEDNESS : single 

*p (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..630 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

■ (3) LOCATION: 1..627 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACG TGC GGC TTC GCC GAC CTC 4 8 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

ATG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGT GCT GCC AGA 96 
Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAA GAC GGC GTG AAC TAT GCA 14 4 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCT GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTA 192 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTG ACC ATT CCA GCT TCC GCT TAT GAG GTG CGC AAC GTG 240 
Leu Ser Cys Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 



HCVENV.-T 



TCC 
Ser 


GGG 
Gly 


ATG 
Met 


TAG 
Tyr 


CAT 
His 
85 


GTC 
Val 


ACG 
Thr 


AAC 
Asn 


GAC 
Asp 


TGC 
Cys 
90 


TCC 
Ser 


AAC 
Asn 


TCA 
Ser 


AGC 
Ser 


ATT 
He 
95 


GTG 
Val 


288 


TAT 
Tyr 


GAG 
Glu 


GCA 
Ala 


GCG 
Ala 
100 


GAC 
Asp 


ATG 
Met 


ATC 
He 


ATG 
Met 


CAC 
His 
105 


ACC 
Thr 


CCC 
Pro 


GGG TGC 
Gly Cys 


GTG 
Val 
110 


CCC 
Pro 


TGC 
Cys 


336 


GTT 
Val 


CGG 
Arg 


GAG 
Glu 
115 


AAC 
Asn 


AAC 
Asn 


TCT 
Ser 


TCC 
Ser 


CGC 
Arg 
120 


TGC 
Cys 


TGG 
Trp 


GTA 
Val 


Ala 


Leu 
125 


ACC 
Thr 


CCC 
Pro 


ACG 
Thr 


384 


CTC 
Leu 


GCA 
Ala 
130 


GCT 
Ala 


AGG 
Arg 


AAC 
Asn 


GCC 
Ala 


AGC 
Ser 
135 


GTC 
Val 


CCC 
Pro 


ACT 
Thr 


ACG 
Thr 


AfA 

Thr 
140 


ATA 

He 


CGA 
Arg 


CGC 
Arg 


CAC 
His 


432 


H3TC 
^al 

Si 45 


GAT 
Asp 


TTG 
Leu 


CTC 
Leu 


GTT 
Val 


GGG 
Gly 
150 


GCG 
Ala 


GCT 
Ala 


GCT 
Ala 


TTC 
Phe 


TGT 
Cys 
155 


TCC 
Ser 


GCT 
Ala 


ATG 
Met 


TAC 
Tyr 


GTG 
Val 
160 


480 


MSGG 
Jtely 


GAT 
Asp 


CTC 
Leu 


TGC 
Cys 


GGA 
Gly 
165 


TCT 
Ser 


GTC 
Val 


TTC 
Phe 


CTC 
Leu 


GTC 
Val 
170 


TCC 
Ser 


CAG 
Gin 


CTG 
Leu 


TTC 
Phe 


ACC 
Thr 
175 


ATC 
He 


528 


rtcG 


CCT 
Pro 


CGC 
Arg 


CGG 
Arg 
180 


CAT 
His 


GAG 
Glu 


ACG 
Thr 


GTG 
Val 


CAG 
Gin 
185 


GAC 
Asp 


TGC 
Cys 


AAT 
Asn 


TGC 
Cys 


TCA 
Ser 
190 


ATC 
He 


TAT 
Tyr 


576 


Sccc 

C?ro 


GGC 
Gly 


CAC 
His 
195 


ATA 
He 


ACA 
Thr 


GGT 
Gly 


CAC 
His 


CGT 
Arg 
200 


ATG 
Met 


GCT 
Ala 


TGG 
Trp 


GAT 
Asp 


ATG 
Met 
205 


ATG 
Met 


ATG 
Met 


.AAC 
Asn 


624 



H:gg TAATAG 633 
Mp r p 

210 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 209 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 



HCVENV.-T 



Leu Ser Cys Leu Thr lie Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

lily Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 

"%er Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 

=£1 180 185 190 

g>ro Gly His lie Thr Gly His Arg Met Ala- Trp Asp Met Met Met Asn 
r; 195 200 205 

LTrp 



fj(2) INFORMATION FOR SEQ ID NO: 9: 

3i (i) SEQUENCE CHARACTERISTICS: 

~ : (A) LENGTH: 483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
. (iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..480 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1..477 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCC CTG CTG TCC TGT 
Met Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 
15 10 15 



CTG ACC ATA CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG TCC GGG GTG 



•» it 



Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Val 
20 25 30 

TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATA GTG TAT GAG GCA 144 
Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr Glu Ala 
35 40 45 

GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC GTT CGG GAG 192 
Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 
50 55 60 

GGC AAC TCC TCC CGT TGC TGG GTG GCG CTC ACT CCC ACG CTC GCG GCC 240 
Gly Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 
65 70 75 80 

AGG AAC GCC AGC GTC CCC ACA ACG ACA ATA CGA CGC CAC GTC GAT TTG 288 
Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 
_ 8 5 90 95 

^CfcTC GTT GGG GCT GCT GCT TTC TGT TCC GCT ATG TAC GTG GGG GAT CTC 336 

ffiteu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu 
=£; 100 105 110 

=™[TGC GGA TCT GTT TTC CTT GTT TCC CAG CTG TTC ACC TTC TCA CCT CGC 38 4 

Sbys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg 

U 115 120 125 

s'cGG CAT CAA ACA GTA CAG GAC TGC AAC TGC TCA ATC TAT CCC GGC CAT 4 32 

^'Arg His Gin Thr Val Gin Asd Cys Asn Cys Ser He Tyr Pro Gly His 
C| 130 135 140 

E!gTA TCA GGT CAC CGC ATG GCT TGG GAT ATG ATG ATG AAC TGG TCC TAATAG 483 

y: Val Ser Gly His Arg Met Ala Trp Asd Met Met Met Asn Trp Ser 
C 145 150 155 160 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 
15 10 15 

Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Val 
20 25 30 

Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr Glu Ala 
35 40 45 

Ala Asd Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 
50 55 60 

Gly Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 
65 70 75 80 



HCVENV.-T 



Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 
85 90 95 

Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu 
100 105 110 

Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg 
115 120 125 

Arg His Gin Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 
130 135 140 

Val Ser Gly His Arg' Met Ala Trp Asp Met Met Met Asn Trp Ser 
145 150 155 



: (2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..477 

(ix) FEATURE: 

(A) NAME /KEY : mat_pept ide 

(B) LOCATION: 1. .474 



(xi) SEQUENCE DESCRIPTION: SEQ 



ATG 


TCC 


GGT 


TGC 


TCT 


TTC 


TCT 


ATC 


TTC 


Met 


Ser Gly 


Cys 


Ser 


Phe 


Ser 


He 


Phe 


1 








5 










CTG 


ACC 


ATA 


CCA 


GCT 


TCC 


GCT 


TAT 


GAA 


Leu 


Thr 


He 


Pro 


Ala 


Ser 


Ala 


Tyr 


Glu 








20 










25 


TAC 


CAT 


GTC 


ACG 


AAC 


GAC 


TGC 


TCC 


AAC 


Tyr 


His 


Val 


Thr 


Asn 


Asp 


Cys 


Ser 


Asn 




35 










40 




GCG 


GAC 


ATG 


ATC 


ATG 


CAC 


ACC 


CCC 


GGG 


Ala 


Asd 


Met 


He 


Met 


His 


Thr 


Pro 


Gly 




50 










55 






GGC 


AAC 


TCC 


TCC 


CGT 


TGC 


TGG 


GTG 


GCG 


Gly 


Asn 


Ser 


Ser Arg 


Cys 


Trp 


Val 


Ala 


65 










70 









D NO: 11: 



CTC 


TTG 


GCC 


CTG 


CTG 


TCC 


TGT 


48 


Leu 


Leu 


Ala 


Leu 


Leu 


Ser Cys 




10 










15 






GTG 


CGC 


AAC 


GTG 


TCC 


GGG 


GTG 


96 


Val 


Arg 


Asn 


Val Ser Gly Val 












30 








TCA 


AGC 


ATA 


GTG 


TAT 


GAG 


GCA 


144 


Ser 


Ser 


He 


Val 


Tyr 


Glu 


Ala 










45 










TGC 


GTG 


CCC 


TGC 


GTT 


CGG 


GAG 


192 


Cys 


Val 


Pro 


Cys 


Val 


Arg 


Glu 








60 












CTC 


ACT 


CCC 


ACG 


CTC 


GCG 


GCC 


240 


Leu 


Thr 


Pro 


Thr 


Leu 


Ala 


Ala 






75 










80 
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AGG AAC GCC AGC GTC CCC ACA ACG ACA ATA CGA CGC CAC GTC GAT TTG 288 

Arg Asn Ala. Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 
85 90 ' 95 

CTC GTT GGG GCT GCT GCT TTC TGT TCC GCT ATG TAG GTG GGG GAT CIC 336 

Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu 
100 105 HO 

TGC GGA TCT GTT TTC CTT GTT TCC CAG CTG TTC ACC TTC TCA CCT CGC 384 

Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg 
115 120 125 

CGG CAT CAA ACA GTA CAG GAC TGC AAC TGC TCA ATC TAT CCC GGC CAT 432 

Arg His Gin Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 

130 135 140 

f=GTA TCA GGT CAC CGC ATG GCT TGG GAT ATG ATG ATG AAC TGG TAATAG 

""2Val Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 

^L45 150 155 

ip(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 
J~ (A) LENGTH: 158 amino acids 

w (B) TYPE: amino acid 

s _ (D) TOPOLOGY: linear 

\j (ii) MOLECULE TYPE: protein 

5: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

4-let Ser Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 
H= 15 10 15 

Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Val 
20 25 30 

Tyr His Val Thr Asn Asd Cys Ser Asn Ser Ser He Val Tyr Glu Ala 
35 " 40 45 

Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 

50 55 60 

Gly Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 

65 70 75 80 

Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 
85 90 95 

Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu 
100 105 110 

Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg 
115 120 125 

Arg His Gin Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 
130 135 140 

Val Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
145 150 155 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(ix) FEATURE: 
r } (A) NAME /KEY : CDS 

'y (B) LOCATION: 1. . 633 

□ (ix) FEATURE : 

H (A) NAME /KEY: mat_peptide 

" (3) LOCATION: 1..630 

iLi 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

IATG CTG GGT AAG GCC ATC GAT ACC CTT ACG TGC GGC TTC GCC GAC CTC 48 
^Met Leu Gly Lys Ala He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
1 5 10 15 

p3TG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 

^/al Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 

* 20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAA GAC GGC GTG AAC TAT GCA 14 4 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 

35 40 45 

ACA. GGG AAT TTG CCT GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTA 192 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTA ACC ATT CCA GCT TCC GCT TAC GAG GTG CGC AAC GTG 240 
Leu Ser Cys Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 
Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 
Tyr Glu Ala Ala Asd Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 ^ 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 38 4 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

CTC GCG GCT AGG AAC GCC AGC ATC CCC ACT ACA ACA ATA CGA CGC CAC 432 
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Leu Ala Ala Arg Asn Ala Ser He Pro Thr Thr Thr He Arg Arg His 

130 135 140 

GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT TCC GCT ATG TAG GTG 480 

Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 

145 150 155 160 

GGG GAT CTC TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC 528 

Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 

165 170 175 

TCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT 57 6 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 

180 185 190 

CCC GGC CAC ATA ACG GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC 624 

Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 



[ TGG TAC TAATAG 
1 Trp Tyr 
210 



! (2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Leu Gly Lys Ala He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser He Pro Thr Thr Thr He Arg Arg His 
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130 135 140 

Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr lie 
165 170 175 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 

Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 

Trp Tyr 
210 

%{2) INFORMATION FOR SEQ ID NO: 15: 

W (i) SEQUENCE CHARACTERISTICS: 
s0 (A) LENGTH: 2 6 base pairs 

Vg (B) TYPE: nucleic acid 

yj (C) STRANDEDNESS :■ single 

Pi (D) TOPOLOGY: linear 

W (ii) MOLECULE TYPE: cDNA 

D (iii) HYPOTHETICAL: NO 

rj (iii) ANTI-SENSE : NO 

ffi 

N : (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ATGCCCGGTT GCTCTTTCTC TATCTT 26 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
ATGTTGGGTA AGGTCATCGA TACCCT 26 
(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

CTATTAGGAC CAGTTCATCA TCATATCCCA 

H2) INFORMATION FOR SEQ ID NO: 18: 

~ j (i) SEQUENCE CHARACTERISTICS: 

^ (A) LENGTH: 27 base pairs 

\D (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
yj (D) TOPOLOGY: linear 

j~ (ii) MOLECULE TYPE: cDNA 

s _ (iii) HYPOTHETICAL: NO 

D 

S! (iii) ANTI-SENSE: YES 



si (xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CTATTACCAG TTCATCATCA TATCCCA 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

ATACGACGCC ACGTCGATTC CCAGCTGTTC ACCATC 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 



m 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GATGGTGAAC AGCTGGGAAT CGACGTGGCG TCGTAT 36 

(2) INFORMATION FOR SEQ ID NO: 21: 

f| (i) SEQUENCE CHARACTERISTICS: 
~ (A) LENGTH: 723 base pairs 

r (B) TYPE: nucleic acid 

3 (C) STRANDEDNESS: single 

Q (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .720 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1..717 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 48 

Met Leu Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 

15 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 

20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 14 4 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 

35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 

50 55 60 

CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 240 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 

65 70 75 80 
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TCC 


GGG 


ATG 


TAG 


CAT 


GTC 


ACCj 


B. BP 

AAC 


PAP 


TP.P 


TPP 


AAC 


TCA 


ACC 


ATT 


GTG 


288 




Ser 


Gly 


Met 


Tyr 


nlS 


vai 


TH y- 

i nr 




El c o 


P \r o 

v-f y o 




Asn 


Ser 


Ser 


He 


Val 










85 










90 










95 








TAT 


GAG 


GCA 


GCG 


GAL 


A TP 

Alb 


AIL 


JVTP 
A 1 Vj 


L-AL. 


app 


ppp 


GGG 


TGC 


r,Tn 

u 1 u 


CCC 


TGC 


336 




Tyr 


Glu 


Ala 


Ala 


Asp 




T 1 « 

lie 


tic t. 




Thr 


Orrt 


Gly Cys 


Val 


Pro 


Cys 










100 










105 










110 










GTT 


CGG 


GAG 


AAC 


AAC 


TCT 


TCC 


CGC 


TGC 


TGG 


GTA 


ccc 


PTP 


ACC 


CCC 


ACG 


384 




val 


Arg 


CjlU 


Asn 


Asn 


Car 






P i/c 


lrp 


Val 

V CI X 


Ala 


Leu 


Thr 


Pro 


Thr 








lie; 










1 41. U 










125 












CTC 


GCA 


GCT 


AGG 


AAC 


GCC 


AGC 


GTC 


ccc 


ACC 


ACG 


apa 
ALA 


Al A 


CGA 


CGC 


CAC 


432 




Leu 


Ala 


Ala 


Arg 


Asn 


ax a 


oer 


Vai 


t L U 


Th r 

L 111. 


Thr 


Thr 


He 


Arg Arg 


His 








130 








135 










140 














GTC 


GAT 


TCC 


CAG 


CTG 


TTC 


ACC 


ATC 


TCG 


CCT 


CGC 


CGG 


CAT 


GAG 


ACb 


GTG 


480 




val 


Asp 


C A -K* 

ber 


C \ r\ 

bin 


Leu 


Dho 

rue 


Th r 

1 ill. 


lie 


oca. 






Arg 


His 


p 1 1 1 
Glu 


inr 


Val 






145 


















155 










160 






CAG 


GAC 


TGC 


AAT 


TGC 


TCA 


ATC 


TAT 


CCC 


GGC 


CAC 


ATA 


ACG 


GGT 


CAC 


CGT 


528 




Gin 


Asp 


Cys 


Asn 


Cys 


ber 


lie 


Tyr 


O t- r>\ 

rlu 


p.1 v 




He 


Thr 


CI v 


Hi q 
n -l o 


Arcr 










loo 










i 7 n 

1 / u 










175 








ATG 


GCT 


TGG 


GAT 


ATG 


ATG 


ATG 


AAC 


TGG 


TCG 


CCT 


ACA 


ACG 


GCC 


CTG 


GTG 


576 


§ 3 ;Met 


Ala 


Trp 


Asp 


Met 


Met 


Met 


Asn 


Trp 


oer 


Pro 


Thr 


Thr 


Ala 


Leu 


V CI -L 




3 






180 










10 3 










190 










GTA 


TCG 


CAG 


CTG 


CTC 


CGG 


ATC 


CCA 


CAA 


GCT 


GTC 


CTC 




ATG 


GTG 


GCG 


624 




Val 


Ser 


Gin 


Leu 


Leu 


Arg 


Ti- 
ne 


Pro 




Zl 1 a 
Aid 


V ci X 


Val 


Asd 


Men 


Val 


Ala 










195 


















205 










w 


GGG 


ppp 


L.A1 


iprr 

1 


vj<jA 


PTP 


CTC 






CTC 


GCC 


TAC 


TAT 


TCC 


ATG 


GTG 


672 




Gly 


Ala 


His 


Trp 


Gly 


Val 


Leu 


Ala 


Gly 


Leu 


Ala 


Tyr 


Tyr 


Ser 


Met 


Vai 






210 








215 










220 














GGG 


AAC 


TGG 


GCT 


AAG 


GTT 


TTG 


ATT 


GTG 


ATG 


CTA 


CTC 


TTT 


GCT 


CCC 


TAATAG 


723 




Gly 


Asn 


Trp 


Ala 


Lys 


Val 


Leu 


He 


Val 


Met 


Leu 


Leu 


Phe 


Ala 


Pro 


240 






225 






230 










235 














(2) 


INFORMATION 


FOR 


SEQ 


ID i 


NO: 


22: 



















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 239 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 
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Thr Gly Asn 
50 

Leu Ser Cys 
65 

Ser Gly Met 



Tyr Glu Ala 



Val Arg Glu 
115 

Leu Ala Ala 
130 

Oval Asp Ser 
=£114 5 

ffiGln Asp Cys 



^Met Ala Trp 



= Val Ser Gin 

_; ! Gly Ala His 
U 210 

OGly Asn Trp 
U,225 



Leu Pro Gly Cys Ser Phe Ser 
55 

Leu Thr Val Pro Ala Ser Ala 
70 

Tyr His Val Thr Asn Asp Cys 

85 90 



He Phe 
60 

Tyr Glu 
75 



Leu Leu Ala Leu 

Val Arg Asn Val 
80 



Ser Asn Ser Ser He Val 
95 



Ala Asp Met He Met 
100 

Asn Asn Ser Ser Arg 
120 



His Thr 
105 



Pro Gly 
Cys Trp Val Ala 



Arg Asn Ala Ser Val Pro Thr 
135 

Gin Leu Phe Thr He Ser Pro 
150 

Asn Cys Ser He Tyr Pro Gly 
165 170 

Asd Met Met Met Asn Trp Ser 
180 185 

Leu Leu Arg He Pro 
200 

Trp Gly Val Leu Ala Gly Leu 
215 

Ala Lys Val Leu He Val Met 
230 



Thr Thr 
140 

Arg Arg 
155 

His He 



Cys Val Pro Cys 
110 

Leu Thr Pro Thr 
125 

He Arg Arg His 



His Glu Thr Val 
160 

Thr Gly His Arg 
175 



Pro Thr Thr Ala Leu Val 
190 



Gin Ala Val Val 



Ala Tyr 
220 

Leu Leu 
235 



Asp Met Val Ala 
205 

Tyr Ser Met Val 



Phe Ala Pro 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 561 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..558 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1..555 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 48 
Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

GTG GGG TAG ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 14 4 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu' Leu Ala Leu 
50 55 60 

GcTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 240 
€iLeu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
ffi 65 70 75 ( 80 

"^TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 28 8 

: b 'Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 
^ j 85 90 95 

UJtAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 
2 Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

~!GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 
yval Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
Ui 115 120 125 

o 

y,CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 4 32 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

GTC GAT TCC CAG CTG TTC ACC ATC TCG CCT CGC CGG CAT GAG ACG GTG 480 
Val Asp Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
145 150 155 160 

CAG GAC TGC AAT TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT 528 
Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 
165 170 175 

ATG GCT TGG GAT ATG ATG ATG AAC TGG TAATAG 561 
Met Ala Trp Asp Met Met Met Asn Trp 
180 185 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 



15 10 15 

Val Gly Tyr lie Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 * 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

©Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
=£; 100 105 110 

r?Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
£j 115 120 125 

^Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
U 130 135 140 

111 

3~Val Asp Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
j«,14 5 150 155 160 

i j Gln Asp Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 
U 165 170 175 

rr\ 

CiMet Ala Trp Asp Met Met Met Asn Trp 
y : 180 185 



(2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 606 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..603 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(3) LOCATION: 1 . . 600 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



ATG TTG GGT AAG GTC ATC 
Met Leu Gly Lys Val lie 
1 5 

GTG GGG TAC ATT CCG CTC 
Val Gly Tyr lie Pro Leu 
20 

GCC CTG GCG CAT GGC GTC 
Ala Leu Ala His Gly Val 
35 

ACA GGG AAT TTG CCC GGT 
Thr Gly Asn Leu Pro Gly 
50 

CTG TCC TGT CTG ACC GTT 
CLeu Ser Cys Leu Thr Val 
J 65 70 

TfjCC GGG ATG TAC CAT GTC 
iSer Gly Met Tyr His Val 

&AT GAG GCA GCG GAC ATG 
bjTyr Glu Ala Ala Asd Met 
100 

H;tt cgg gag aac aac tct 
3/al Arg Glu Asn Asn Ser 
U 115 

fZCTC GCA GCT AGG AAC GCC 
yLeu Ala Ala Arg Asn Ala 
130 

GTC GAT TCC CAG CTG TTC 
Val Asd Ser Gin Leu Phe 
145 ' 150 
CAG GAC TGC AAT TGC TCA 
Gin Asp Cys Asn Cys Ser 
165 

ATG GCT TGG GAT ATG ATG 
Met Ala Trp Asp Met Met 
130 

GTA TCG CAG CTG CTC CGG 
Val Ser Gin Leu Leu Arg 
195 



GAT 


ACC 


CTT 


ACA 


TGC 


GGC 


Asp 


Thr 


Leu 


Thr 


Cys 


Gly 






10 






GTC 


GGC 


GCC 


CCC 


CTA 


GGG 


Val 


Gly 


Ala 


Pro 


Leu Gly 






25 








CGG 


GTT 


CTG 


GAG 


GAC 


GGC 


Arg 


Val 


Leu 


Glu 


Asp Gly 




40 










TGC 


TCT 


TTC 


TCT 


ATC 


TTC 


Cys 


Ser 


Phe 


Ser 


He 


Phe 


55 










60 


CCA 


GCT 


TCC 


GCT 


TAT 


GAA 


Pro 


Ala 


Ser 


Ala 


Tyr Glu 










75 




ACG 


AAC 


GAC 


TGC 


TCC 


AAC 


Thr 


Asn 


Asp 


Cys 


Ser 


Asn 








90 






ATC 


ATG 


CAC 


ACC 


CCC 


GGG 


He 


Met 


His 


Thr 


Pro 


Gly 






105 








TCC 


CGC 


TGC 


TGG 


GTA 


GCG 


Ser 


Arg 


Cys 


Trp 


Val 


Ala 




120 










AGC 


GTC 


CCC 


ACC 


ACG 


ACA 


Ser 


Val 


Pro 


Thr 


Thr 


Thr 


135 










140 


ACC 


ATC 


TCG 


CCT 


CGC 


CGG 


Thr 


He 


Ser 


Pro 


Arg 


Arg 










155 




ATC 


TAT 


CCC 


GGC 


CAC 


ATA 


He 


Tyr 


Pro 


Gly 


His 


He 








170 






ATG 


AAC 


TGG 


TCG 


CCT 


ACA 


Met 


Asn 


Tro 


Ser 


Pro 


Thr 






185 








ATC 


CTC 


TAATAG 






He 


Leu 











200 



TTC GCC GAC CTC 48 
Phe Ala Asp Leu 
15 

GGC GCT GCC AGG 96 
Gly Ala Ala Arg 
30 

GTG AAC TAT GCA 14 4 

Val Asn Tyr Ala 
45 

CTC TTG GCT TTG 192 
Leu Leu Ala Leu 



GTG CGC AAC GTG 24 0 

Val Arg Asn Val 
80 

TCA AGC ATT GTG 288 
Ser Ser He Val 
95 

TGC GTG CCC TGC 336 
Cys Val Pro Cys 
110 

CTC ACC CCC ACG 38 4 

Leu Thr Pro Thr 

125 

ATA CGA CGC CAC 4 32 

He Arg Arg His 



CAT GAG ACG GTG 4 80 

His Glu Thr Val 
160 

ACG GGT CAC CGT 528 
Thr Gly His Arg 
175 

ACG GCC CTG GTG 57 6 

Thr Ala Leu Val 
190 

606 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 200 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



HCVENV-T 





(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Leu Gly Lys Val He Aso Thr Leu Thr Cys Gly Phe Ala Asp Leu 
1 5 ' 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

CSer Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 

*fi 85 90 95 

rir : 

^Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
't! 100 105 110 

IWal Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
G 115 120 125 

i s I 

r*Leu Ala Ala Arg Asn Ala Ser Val Pro ' Thr Thr Thr He Arg Arg His 
L : 130 135 140 

a i M i 

^Val Asd Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
Ul45 ^ 150 155 160 

DP 

DGln Asd Cys Asn Cys Ser He Tyr Pro Glv His He Thr Gly His Arg 
P, * 165 170 175 

Met Ala Trp Asd Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val 
180 135 190 

Val Ser Gin Leu Leu Arg He Leu 
195 200 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..633 



un/cw\/ _T 



(ixj* FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1..630 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 
Met Leu Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 



!CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 
Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 



CAT CAC TAATAG 
His His 



48 



GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 

20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 14 4 

^Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 

G 35 40 45 

.f; 

fflACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 

jfiThr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 

,?! *" 50 55 60 



240 



pTCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 
^iSer Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
«I 85 90 95 



zJ TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 
C j Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
U 100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 
Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 4 32 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

GTC GAT TCC CAG CTG TTC ACC ATC TCG CCT CGC CGG CAT GAG ACG GTG 480 
Val Asd Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
145 ^ 150 155 160 

CAG GAC TGC AAT TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT 528 
Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 
165 170 175 

ATG GCT TGG GAT ATG ATG ATG AAC TGG TCG CCT ACA ACG GCC CTG GTG 57 6 

Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val 
180 185 190 

GTA TCG CAG CTG CTC CGG ATC GTG ATC GAG GGC AGA CAC CAT CAC CAC 624 
Val Ser Gin Leu Leu Arg He Val He Glu Gly Arg His His His His 
195 200 205 



636 



210 



(2) INFORMATION FOR SEQ ID NO: 28: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Wal Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
£i 20 25 30 

ff= 

a Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

4rhr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
U so 55 60 

= Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
q65 70 75 80 

Jer Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
U 85 90 95 

if?": 

Qyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
L : 100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

Val Asp Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
145 150 155 160 

Gin Asd Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 
165 170 175 

Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val 
180 185 190 

Val Ser Gin Leu Leu Arg He Val He Glu Gly Arg His His His His 
195 200 205 

His His 
210 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..627 

(ix) FEATURE: 
c = (A) NAME /KEY : mat_peptide 

(B) LOCATION: 1..624 

IX! 

yp (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

iTjATG GGT AAG GTC ATC GAT ACC CTT ACG TGC GGA TTC GCC GAT CTC ATG 4 3 

?i Met Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Met 
r; 1 5 10 15 

^GGG TAC ATC CCG CTC GTC GGC GCT CCC GTA GGA GGC GTC GCA AGA GCC 96 
%JGly Tyr He Pro Leu Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala 
Si 20 25 30 

ScTT GCG CAT GGC GTG AGG GCC CTT GAA GAC GGG ATA AAT TTC GCA ACA 14 4 

-Leu Ala His Gly Val Arg Ala Leu Glu Asp Gly He Asn Phe Ala Thr 
^ 35 40 45 

GGG AAT TTG CCC GGT TGC TCC TTT TCT ATT TTC CTT CTC GCT CTG TTC 192 
Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Phe 
50 55 60 

TCT TGC TTA ATT CAT CCA GCA GCT AGT CTA GAG TGG CGG AAT ACG TCT 24 0 

Ser Cys Leu He His Pro Ala Ala Ser Leu Glu Trp Arg Asn Thr Ser 
65 70 75 80 

GGC CTC TAT GTC CTT ACC AAC GAC TGT TCC AAT AGC AGT ATT GTG TAC * 288 

Gly Leu Tyr Val Leu Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr 
85 90 95 

GAG GCC GAT GAC GTT ATT CTG CAC ACA CCC GGC TGC ATA CCT TGT GTC 336 
Glu Ala Asp Asp Val He Leu His Thr Pro Gly Cys He Pro Cys Val 
100 105 110 

CAG GAC GGC AAT ACA TCC ACG TGC TGG ACC CCA GTG ACA CCT ACA GTG 384 
Gin Asp Gly Asn Thr Ser Thr Cys Trp Thr Pro Val Thr Pro Thr Val 
115 120 125 

GCA GTC AAG TAC GTC GGA GCA ACC ACC GCT TCG ATA CGC AGT CAT GTG 432 
Ala Val Lys Tyr Val Gly Ala Thr Thr Ala Ser He Arg Ser His Val 
130 135 140 

GAC CTA TTA GTG GGC GCG GCC ACG ATG TGC TCT GCG CTC TAC GTG GGT 480 
Asd Leu Leu Val Gly Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly 
145 150 155 160 



CP 



GAC ATG TGT GGG GCT GTC TTC CTC GTG GGA CAA GCC TTC ACG TTC AGA 528 

Asd Met Cys Gly Ala Val Phe Leu Val Gly Gin Ala Phe Thr Phe' Arg 

165 170 175 

CCT CGT CGC CAT CAA ACG GTC CAG ACC TGT AAC TGC TCG CTG TAC CCA 57 6 

Pro Arg Arg His Gin Thr Val Gin Thr Cys Asn Cys Ser Leu Tyr Pro 
180 185 190 

GGC CAT CTT TCA GGA CAT CGA ATG GCT TGG GAT ATG ATG ATG AAC TGG 624 

Gly His Leu Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
195 200 205 



TAATAG 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 
Si (A) LENGTH: 208 amino acids 

=£! (B) TYPE: amino acid 

^ri (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

W (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

O Met Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Met 
%\ 1 5 10 15 

™ ; Gly Tyr He Pro Leu Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala 
20 25 30 

JKS. 

N : Leu Ala His Gly Val Arg Ala Leu Glu Asp Gly He Asn Phe Ala Thr 
35 40 45 

Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Phe 
50 55 60 

Ser Cys Leu He His Pro Ala Ala Ser Leu Glu Trp Arg Asn Thr Ser 
65 70 75 80 

Gly Leu Tyr Val Leu Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr 
85 90 95 

Glu Ala Asp Asp Val He Leu His Thr Pro Gly Cys He Pro Cys Val 
100 105 HO 

Gin Asp Gly Asn Thr Ser Thr Cys Trp Thr Pro Val Thr Pro Thr Val 
115 120 125 

Ala Val Lys Tyr Val Gly Ala Thr Thr Ala Ser He Arg Ser His Val 
130 135 140 

Asp Leu Leu Val Gly Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly 
145 150 155. 160 

Asp Met Cys Gly Ala Val Phe Leu Val Gly Gin Ala Phe Thr Phe Arg 
165 170 1*75 



634 



>ro Arg Arg His Gin Thr Val Gin Thr Cys Asn Cys Ser Leu Tyr Pro 
180 185 190 

Gly His Leu Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 





(ii) 


MOLECULE TYPE: 


cDNA 




















S3 


(iii) 


HYPOTHETICAL: NO 






















w 


(iii) 


ANTI-SENSE: 


NO 
























hi 


(ix) 


FEATURE : 

(A) NAME /KEY : 

(B) LOCATION: 


CDS 
1. .627 




















ffi 


(ix) 


FEATURE : 
(A) NAME /KEY : 
(3) LOCATION: 


mat peptide 
1. .624 




















(xi) 


SEQUENC 


E DESCRIPTION: SEQ ID NO: 31: 












ATG 
Met 
1 


GGT 
Gly 


AAG 
Lys 


GTC 
Val 


ATC 
He 
5 


GAT 
Asp 


ACC 
Thr 


CTA 
Leu 


ACG 
Thr 


TGC 
Cys 
10 


GGA 
Gly 


TTC 
Phe 


GCC 
Ala 


GAT 
Asp 


CTC 
Leu 
15 


ATG 
Met 


48 


GGG TAT 
Gly Tyr 


ATC 
He 


CCG 
Pro 
20 


CTC 
Leu 


GTA 
Val 


GGC 
Gly 


GGC 
Gly 


CCC 
Pro 
25 


ATT 
He 


GGG 
Gly 


GGC GTC GCA AGG 
Gly Val Ala Arg 
30 


GCT 
Ala 


96 


CTC 
Leu 


GCA 
Ala 


CAC 
His 
35 


GGT 
Gly 


GTG 
Val 


AGG 
Arg 


GTC 
Val 


CTT 
Leu 
40 


GAG 
Glu 


GAC 
Asp 


GGG 
Gly 


GTA 
Val 


AAC 
Asn 
45 


TAT GCA 
Tyr Ala 


ACA 
Thr 


144 


GGG AAT 
Gly Asn 
50 


TTA 
Leu 


CCC 
Pro 


GGT 
Gly 


TGC 
Cys 


TCT 
Ser 
55 


TTC 
Phe 


TCT 
Ser 


ATC 
He 


TTT 
Phe 


ATT 
He 
60 


CTT 
Leu 


GCT 
Ala 


CTT 
Leu 


CTC 
Leu 


192 


TCG TGT 
Ser Cys 
65 


CTG 
Leu 


ACC 
Thr 


GTT 
Val 


CCG 
Pro 
70 


GCC 
Ala 


TCT 
Ser 


GCA 
Ala 


GTT 
Val 


CCC 
Pro 
75 


TAC CGA AAT 
Tyr Arg Asn 


GCC 
Ala 


TCT 
Ser 
80 


240 


GGG 


ATT 


TAT 


CAT 


GTT 


ACC 


AAT 


GAT 


TGC 


CCA 


AAC 


TCT 


TCC 


ATA 


GTC 


TAT 


288 



31y He Tyr His Val Thr Asn Asp Cys Pro Asn Ser Ser He Val Tyr 
85 90 95 

GAG GCA GAT AAC CTG ATC CTA CAC GCA CCT GGT TGC GTG CCT TGT GTC 336 




Glu 


Ala 


Asp 


Asn 
100 


Leu 


He 


Leu 


His 


Ala 
105 


Pro Gly Cys Val 


Pro 
110 


Cys 


Val 




ATG 
Met 


ACA 

Thr 


Gly 
115 


AA1 

Asn 


PTP 

Val 


Avj 1 

Ser 


apa 

nun 

Arg 


TPP 

Cys 
120 


TGG 
Trp 


GTC 
Val 


CAA 
Gin 


ATT 
He 


ACC 
Thr 
125 


CCT 
Pro 


ACA 
Thr 


CTG 
Leu 


384 


TCA 
Ser 


GCC 
Ala 
130 


PPP 

Pro 


app 
Ser 


Leu 


PPA 

Gly 


PPA 

Ala 
135 


PTP 

Val 


ACG 
Thr 


GCT 
Ala 


CCT 
Pro 


CTT CGG 
Leu Arg 
140 


AGA 
Arg 


GCC 
Ala 


GTT 
Val 


432 


pap 

Asp 
145 


Tar* 
Tyr 


pta 

Lin 

Leu 


Ala 


Gly 


pp. p. 
Gly 
150 


PPT 

Ala 


Ala 


CTC TGC 
Leu Cys 


TCC 
Ser 
155 


GCG 
Ala 


TTA 
Leu 


TAC 
Tyr 


GTA 
Val 


GGA 
Gly 
160 


480 


nap 

OA*— 

Asp 


Ala 


TP.T 

Cys 


P.PP 

OOVJ 

Gly 


PPA 

Ala 
165 


PTA 

Leu 


TTC 
Phe 


TTG 
Leu 


GTA GGC CAA ATG 
Val Gly Gin Met 
170 


TTC 
Phe 


ACC 
Thr 


TAT 
Tyr 
175 


AGG 
Arg 


528 


— ppt 


pp.p 
Arg 


pap 
Gin 


TAP 

His 
130 


GCT 
Ala 


ACG 
Thr 


GTG 
Val 


CAG 
Gin 


AAC 
Asn 
185 


TGC 
Cys 


AAC 
Asn 


TGT 
Cys 


TCC 
Ser 


ATT 
He 

190 


TAC 
Tyr 


AGT 
Ser 


576 


WGGC 
©Gly 


CAT 
His 


GTT 
Val 
195 


ACC 
Thr 


GGC 
Gly 


CAC 
His 


CGG 
Arg 


ATG 
Met 
200 


GCA 
Ala 


TGG GAT 
Trp Asp 


ATG 
Met 


ATG 
Met 
205 


ATG 
Met 


AAC 
Asn 


TGG 
Trp 


624 


JLtaatag 






























630 



m 

Q(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Met 
15 10 15 

Gly Tyr He Pro Leu Val Gly Gly Pro He Gly Gly 'Val Ala Arg Ala 
20 25 30 

Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala Thr 
35 40 45 

Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe He Leu Ala Leu Leu 
50 55 60 

Ser Cys Leu Thr Val Pro Ala Ser Ala Val Pro Tyr Arg Asn Ala Ser 
65 70 75 80 

Gly He Tyr His Val Thr Asn Asp Cys Pro Asn Ser Ser He Val Tyr 
85 90 95 





Glu Ala Asp Asn 
100 



Leu lie Leu His 



Ala Pro Gly Cys Val Pro Cys Val 
105 110 



Met Thr Gly Asn 
115 

Ser Ala Pro Ser 
130 

Asp Tyr Leu Ala 
145 

Asp Ala Cys Gly 



Pro Arg Gin His 
180 



Val Ser Arg Cys 
120 

Leu Gly Ala Val 
135 

Gly Gly Ala Ala 
150 

Ala Leu Phe Leu 
165 

Ala Thr Val Gin 



Trp Val Gin He Thr Pro Thr Leu 
125 

Thr Ala Pro Leu Arg Arg Ala Val 
140 

Leu Cys Ser Ala Leu Tyr Val Gly 
155 160 

Val Gly Gin Met Phe Thr Tyr Arg 
170 175 

Asn Cys Asn Cys Ser lie Tyr Ser 
185 190 



JfiGly His Val Thr 
195 



Gly His Arg Met 
200 



Ala Trp Asp Met Met Met Asn Trp 
205 



jH(2) INFORMATION FOR SEQ ID NO: 33: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



* (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

TGGGATATGA TGATGAACTG GTC 

(2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 




CTATTATGGT GGTAAGCCAC AGAGCAGGAG 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1476 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..1473 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1..1470 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



Hgg 


GAT 


ATG 


ATG 


ATG 


AAC 


TGG 


TCG 


CCT 


ACA 


ACG 


GCC 


CTG 


GTG 


GTA 


TCG 


48 


^trp Asp 


Met 


Met 


Met 


Asn 


Trp 


Ser 


Pro 


Thr 


Thr 


Ala 


Leu 


Val 


Val 


Ser 












5 










10 










15 






□SAG CTG CTC 


CGG 


ATC 


CCA 


CAA 


GCT 


GTC 


GTG 


GAC 


ATG 


GTG 


GCG 


GGG 


GCC 


96 


Man 


Leu 


Leu 


Arg 


lie 


Pro 


Gin 


Ala 


Val 


Val 


Asp 


Met 


Val 


Ala 


Gly 


Ala 










20 










25 










30 








CAT 


TGG 


GGA 


GTC 


CTG 


GCG 


GGC 


CTC 


GCC 


TAC 


TAT 


TCC 


ATG 


GTG 


GGG 


AAC 


144 


His 


Trp 


Gly 


Val 


Leu 


Ala 


Gly 


Leu 


Ala 


Tyr 


Tyr 


Ser 


Met 


Val 


Gly 


Asn 






35 










40 










45 










TGG 


GCT 


AAG 


GTT 


TTG 


GTT 


GTG 


ATG 


CTA 


CTC 


TTT 


GCC 


GGC 


GTC 


GAC 


GGG 


192 


Trp 


Ala 


Lys 


Val 


Leu 


Val 


Val 


Met 


Leu 


Leu 


Phe Ala Gly 


Val 


Asp 


Gly 




50 








55 










60 












CAT 


ACC 


CGC 


GTG 


TCA 


GGA 


GGG 


GCA 


GCA 


GCC 


TCC 


GAT 


ACC 


AGG 


GGC 


CTT 


240 


His 


Thr 


Arg 


Val 


Ser 


Gly 


Gly 


Ala 


Ala 


Ala 


Ser Asp 


Thr 


Arg Gly 


Leu 




65 








70 










75 










80 




GTG 


TCC 


CTC 


TTT 


AGC 


ccc 


GGG 


TCG 


GCT 


CAG 


AAA 


ATC 


CAG 


CTC 


GTA 


AAC 


288 


Val 


Ser 


Leu 


Phe 


Ser 


Pro 


Gly 


Ser 


Ala 


Gin 


Lys 


He 


Gin 


Leu 


Val 


Asn 












85 










90 










95 






ACC 


AAC 


GGC 


AGT 


TGG 


CAC 


ATC 


AAC 


AGG 


ACT 


GCC 


CTG 


AAC 


TGC 


AAC 


GAC 


-336 


Thr 


Asn 


Gly 


Ser 


Trp 


His 


He 


Asn 


Arg 


Thr 


Ala 


Leu 


Asn 


Cys 


Asn 


Asp 








100 








105 










110 








TCC 


CTC 


CAA 


ACA 


GGG 


TTC 


TTT 


GCC 


GCA 


CTA 


TTC 


TAC 


AAA 


CAC 


AAA 


TTC 


384 


Ser 


Leu 


Gin 


Thr 


Gly 


Phe 


Phe 


Ala 


Ala 


Leu 


Phe 


Tyr 


Lys 


His 


Lys 


Phe 








115 








120 










125 










AAC 


TCG 


TCT 


GGA 


TGC 


CCA 


GAG 


CGC 


TTG 


GCC 


AGC 


TGT 


CGC 


TCC 


ATC 


GAC 


432 



HCVENV.-T 



Asn 


Ser 


Ser 


Gly 


Cys 


Pro 


Glu Arg 


Leu 


Ala 


Ser. 


Cys 


Arg 


Ser 


He 


Asp 






130 






135 










140 












AST 


1 1L. 


PPT 


par 


GGG 


TGG 


GGT 


CCC 


CTC 


ACT 


TAC 


ACT 


GAG 


CCT 


AAC 


AGC 


480 


Lys 


Phe 


Ala 


Gin 


Gly Trp 


Gly 


Pro 


Leu 


Thr 


Tyr 


Thr 


Glu 


Pro 


Asn 


Ser 




145 










150 










155 










160 






bftL 


L-A\j 




CCC 


TAC 


TGC 


TGG 


CAC 


TAC 


GCG 


CCT 


CGA 


CCG 


TGT 


GGT 


528 


Ser 


Asp 


Gin 


Arg 


Pro 


Tyr 


Cys 


Trp 


His 


Tyr 


Ala 


Pro 


Arg 


Pro 


Cys 


Gly 








165 










170 










175 






AI 1 


o 1 A 


err 


crc 


TCT 


CAG 


GTG 


TGC 


GGT 


CCA 


GTG 


TAT 


TGC 


TTC 


ACC 


CCG 


576 


lie 


Val 


Pro 


Ala 


Ser 


Gin 


Val 


Cys 


Gly 


Pro 


Val 


Tyr 


Cys 


Phe 


Thr 


Pro 










180 










185 










190 








AuO 


1 


oil 




GTG 


GGG 


ACG 


ACC 


GAT 


CGG 


TTT 


GGT 


GTC 


CCC 


ACG 


TAT 


624 


Ser 


Pro 


Val 


Val 


Val 


Gly 


Thr 


Thr 


Asp 


Arg 


Phe 


Gly 


Val 


Pro 


Thr 


Tyr 




=«=■. 




195 








200 










205 










^AAC 




crc. 


crc 


AAC 


GAC 


TCG 


GAT 


GTG 


CTG 


ATT 


CTC 


AAC 


AAC 


ACG 


CGG 


672 




Trp 


Gly 


Ala 


Asn 


Asp 


Ser Asp 


Val 


Leu 


He 


Leu 


Asn 


Asn 


Thr 


Arg 






210 








215 










220 












yCJCG 


CCG 


CGA 


r** r*> 

GGC 


AAC 


TGG 


TTC 


GGC 








ATC 
nl u 


AAT 


GGC 


APT 


\3\3\3 


720 




Pro 


Arg 


Gly 


Asn 


Trp 


Phe 


Gly 


Cys 


Thr 


TrD 


Met 


Asn 


Gly 


Thr 


Gly 




h225 






230 










235 










240 




3 TTC 


ACC 


AAG 


ACG 


TGT 


GGG 


GGC 


CCC 


CCG 


TGC 


AAC 


ATC 


GGG 


GGG 


GCC 


GGC 


768 


&he 


Thr 


Lys 


Thr 


Cys 


Gly 


Gly 


Pro 


Pro 


Cys 


Asn 


lie 


Gly Gly 


Ala 


Gly 




"t. i 






245 










250 










255 






Sac 


AAC 


ACC 


TTG 


ACC 


TGC 


CCC 


ACT 


GAC 


TGT 


TTT 


CGG 


AAG 


CAC 


CCC 


GAG 


816 




Asn 


Thr 


Leu 


Thr 


Cys 


Pro 


Thr 


Asp 


Cys 


Phe 


Arg 


Lys 


His 


Pro 


Glu 










260 








265 










270 












Tar* 
1 AC 




AGA 


TGC 


GGT 


TCT 


ccc 


CCC 


TGG 


CTG 


ACA 


CCT 


AGG 


TGT 


864 


Ala 


Thr 


Tyr 


Ala 


Arg Cys 


Gly 


Ser 


Gly 


Pro 


Trp 


Leu 


Thr 


Pro 


Arg 


Cys 








275 










280 










285 










Al b 


(j 1 I 


cat 




CCA 


TAT 


AGG 


CTC 


TGG 


CAC 


TAC 


CCC 


TGC 


ACT 


GTC 


AAC 


912 


Met 


Val 


His 


Tyr 


Pro Tyr 


Arg 


Leu 


Trp 


His 


Tyr 


Pro 


Cys 


Thr 


Val 


Asn 






290 








295 










300 












TTC 


ACt- 


AiL 


1 I L 


AAG 


GTT 


AGG 


ATG 


TAC 


GTG 


GGG 


GGC 


GTG 


GAG 


CAC 


AGG 


960 


Phe 


Thr 


He 


Phe 


Lys 


Val 


Arg 


Met 


Tyr 


Val 


Gly 


Gly 


Val 


Glu 


His 


Arg 




305 








310 










315 










320 




TTC 


GAA 


GCC 


GCA 


TGC 


AAT 


TGG 


ACT 


CGA 


GGA 


GAG 




TGT 


GAC 


1 1 VJ 


GAG 


1008 


Phe 


Glu 


Ala 


Ala 


Cys 


Asn 


Trp Thr Arg Gly Glu 


Arg 


Cys 


Asp 


Leu 


Glu 












325 










330 










335 






GAC 


AGG 


GAT 


AGA 


TCA 


GAG 


CTT 


AGC 


CCG 


CTG 


CTG 


CTG 


TCT 


ACA 


ACA 


GAG 


1056 


Asp 


Arg 


Asp 


Arg 


Ser 


Glu 


Leu 


Ser 


Pro 


Leu 


Leu 


Leu 


Ser 


Thr 


Thr 


Glu 




340 










345 










350 








TGG 


CAG 


ATA 


CTG 


CCC 


TGT 


TCC 


TTC 


ACC 


ACC 


CTG 


CCG 


GCC 


CTA 


TCC 


ACC 


1104 


Trp 


Gin 


lie 


Leu 


Pro 


Cys 


Ser 


Phe 


Thr 


Thr 


Leu 


Pro 


Ala 


Leu 


Ser 


Thr 






355 










360 










365 










. GGC 


CTG 


ATC 


CAC 


CTC 


CAT 


CAG 


AAC 


ATC 


GTG 


GAC 


GTG 


CAA 


TAC 


CTG 


TAC 


1152 


Gly 


Leu 


He 


His 


Leu 


His 


Gin 


Asn 


He 


Val 


Asp 


Val 


Gin 


Tyr 


Leu 


Tyr 





370 375 



GGT GTA GGG TCG GCG GTT GTC TCC CTT GTC ATC AAA TGG GAG TAT GTC 1200 
Gly Val Gly Ser Ala Val Val Sex Leu Val He Lys Trp Glu Tyr Val 
385 390 395 400 

CTG TTG CTC TTC CTT CTC CTG GCA GAC GCG CGC ATC TGC GCC TGC TTA 1248 
Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg He Cys Ala Cys Leu 
405 410 415 

TGG ATG ATG CTG CTG ATA GCT CAA GCT GAG GCC GCC TTA GAG AAC CTG 1296 
Trp Met Met Leu Leu He Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu 
420 425 430 

GTG GTC CTC AAT GCG GCG GCC GTG GCC GGG GCG CAT GGC ACT CTT TCC 134 4 

Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala His Gly Thr Leu Ser 
435 440 445 

r=-TTC CTT GTG TTC TTC TGT GCT GCC TGG TAC ATC AAG GGC AGG CTG GTC 1392 

ifphe Leu Val Phe Phe Cys Ala Ala Trp Tyr He Lys Gly Arg Leu Val 
J! 450 455 460 

CfcCT GGT GCG GCA TAC GCC TTC TAT GGC GTG TGG CCG CTG CTC CTG CTT 14 40 

Cipro Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu 
h|465 470 475 480 

HcTG CTG GCC TTA CCA CCA CGA GCT TAT GCC TAGTAA 14 7 6 

^Leu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
s_ 485 490 

SI ( 2 ) INFORMATION FOR SEQ ID NO: 36: 

S| (i) SEQUENCE CHARACTERISTICS: 
'% (A) LENGTH: 4 90 amino acids 

T\ (3) TYPE: amino acid 

H : ( D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Trp Asd Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser 
.1 5 10 15 

Gin Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala 
20 25 30 

His Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn 
35 40 45 

Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly' 
50 55 60 

His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu 
65 70 75 80 

Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn 
85 90 95 

Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp 
100 105 110 
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Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe 
115 120 125 

Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Asp 
130 135 140 

Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser • 
145 150 155 160 

Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly 
165 170 175 

He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro 
180 185 190 

Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr 
195 200 205 

== j Asn Trp Glv Ala Asn Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg 
D 210 * 215 220 

riPro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly 
"225 230 235 240 

Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He Gly Gly Ala Gly 
245 250 255 

a Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu 
y 260 265 270 

-Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys 
«; 275 280 285 

WMet Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn 
U 290 295 300 

Phe Thr He Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg 
305 310 315 320 

Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu . 

325 330 335 

Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu 
340 345 350 

Trp Gin He Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr 
355 360 365 

Gly Leu He His Leu His Gin Asn He Val Asp Val Gin Tyr Leu Tyr 
370 375 380 

Gly Val Gly Ser Ala Val Val Ser Leu Val He Lys Trp Glu Tyr Val 
385 390 395 400 

Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg He Cys Ala Cys Leu 
405 410 415 

Trp Met Met Leu Leu He Ala Gin Ala Glu Ala' Ala Leu Glu Asn Leu 
420 425 430 

Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala His Gly Thr Leu Ser 
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435 440 445 

Phe Leu Val Phe Phe Cys Ala Ala Trp Tyr He Lys Gly Arg Leu Val 
450 455 460 

Pro Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu 
465 470 475 480 

Leu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
485 490 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1021 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

:; (ii) MOLECULE TYPE: cDNA 
} (iii) HYPOTHETICAL: NO 
j (iii) ANTI-SENSE : NO 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

'i <B) LOCATION:. 2. . 1018 

I 

;! (ix) FEATURE: 

\ (A) NAME /KEY : mat_peptide 

3 j (B) LOCATION: 2. . 1015 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

G ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA 4 6 

He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly 
15 10 15 

GTC CTG GCG GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG 94 
Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys 
20 25 30 

GTT TTG GTT GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT ACC CGC 14 2 

Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His -Thr Arg 
35 40 45 

GTG TCA GGA GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT GTG TCC CTC 190 
Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu 
50 55 60 

TTT AGC CCC GGG TCG GCT CAG AAA ATC CAG CTC GTA AAC ACC AAC GGC 238 
Phe Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly 
65 70 75 

AGT TGG CAC ATC AAC AGG ACT GCC CTG AAC TGC AAC GAC TCC CTC CAA 28 6 

Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin 
80 85 90 95 



•0 



ACA GGG TTC TTT GCC GCA 
Thr Gly Phe Phe Ala Ala 
100 

GGA TGC CCA GAG CGC TTG 
Gly Cys Pro Glu Arg Leu 
115 

CAG GGG TGG GGT CCC CTC 
Gin Gly Trp Gly Pro Leu 
130 

AGG CCC TAC TGC TGG CAC 
Arg Pro Tyr Cys Trp His 
145 

GCG TCT CAG GTG TGC GGT 
Ala Ser Gin Val Cys Gly 
160 165 



& j GTG GTG GGG ACG ACC GAT 

CiVal Val Gly Thr Thr Asp 

~f\ 130 

_2[GCG AAC GAC TCG GAT GTG 

MAla Asn Aso Ser Asp Val 
W 195 



CTA TTC TAC 
Leu Phe Tyr 



GCC AGC TGT 
Ala Ser Cys 
120 

ACT TAC ACT 
Thr Tyr Thr 
135 

TAC GCG CCT 
Tyr Ala Pro 
150 

CCA GTG TAT 
Pro Val Tyr 



CGG TTT GGT 
Arg Phe Gly 



AAA CAC AAA 
Lys His Lys 
105 

CGC TCC ATC 
Arg Ser lie 



GAG CCT AAC 
Glu Pro Asn 



OGGC AAC TGG 
vIGly Asn Trp 

«; 210 



rri 



"ACG TGT GGG 
^ ! Thr Cys Gly 
M< 225 

TTG ACC TGC 
Leu Thr Cys 
240 

GCC AGA TGC 
Ala Arg Cys 



TAC CCA TAT 
Tyr Pro Tyr 



TTC AAG GTT 
Phe Lys Val 
290 

GCA TGC AAT 
Ala Cys Asn 
305 

AGA TCA GAG 
Arg Ser Glu 
320 



TTC GGC TGT 
Phe Gly Cys 



GGC CCC CCG 
Gly Pro Pro 



CCC ACT GAC 
Pro Thr Asp 
245 

GGT TCT GGG 
Gly Ser Gly 
260 

AGG CTC TGG 
Arg Leu Trp 
275 

AGG ATG TAC 
Arg Met Tyr 



CTG ATT 
Leu lie 



ACA TGG 
Thr Trp 
215 

TGC AAC 
Cys Asn 
230 

TGT TTT 
Cys Phe 



CTC 
Leu 
200 

ATG 
Met 



ATC 
He 



CGG 
Arg 



CGA CCG TGT 
Arg Pro Cys 
155 

TGC TTC ACC 
Cys Phe Thr 
170 

GTC CCC ACG 
Val Pro Thr 
185 

AAC AAC ACG 
Asn Asn Thr 



TTC AAC 
Phe Asn 



GAC AAG 
Aso Lys 
125 

AGC TCG 
Ser Ser 
140 

GGT ATT 
Gly He 



TCG TCT 
Ser Ser 
110 

TTC GCT 
Phe Ala 



GAC CAG 
Asp Gin 



GTA CCC 
Val Pro 



CCG AGC 
Pro Ser 



TAT AAC 
Tyr Asn 



CGG CCvj 
Arg Pro 
205 



CCT GTT 
Pro Val 
175 

TGG GGG 
Tro Gly 
190 

CCG CGA 
Pro Arg 



CCC TGG CTG 
Pro Trp Leu 



TGG ACT CGA 
Trp Thr Arg 



CTT AGC CCG 
Leu Ser Pro 
325 



CAC TAC CCC 
His Tyr Pro 
280 

GTG GGG GGC 
Val Gly Gly 
295 

GGA GAG CGT 
Gly Glu Arg 
310 

CTG CTG CTG 
Leu Leu Leu 



AAT GGC ACT 
Asn Gly Thr 



GGG GGG GCC 
Gly Gly Ala 
235 

AAG CAC CCC 
Lys His Pro 
250 

ACA CCT AGG 
Thr Pro Arg 
265 

TGC ACT GTC 
Cys Thr Val 



GGG TTC 
Gly Phe 
220 

GGC AAC 
Gly Asn 



GAG GCC 
Glu Ala 



GTG GAG CAC 
Val Glu His 



TGT GAC TTG 
Cys Asp Leu 
315 

TCT ACA. ACA 
Ser Thr Thr 
330 



TGT ATG 
Cys Met 



AAC TTC 
Asn Phe 
285 

AGG TTC 
Arg Phe 
300 

GAG GAC 
Glu Asp 



GAG TGG 
Glu Trp 



ACC AAG 
Thr Lys 



AAC ACC 
Asn Thr 



ACC TAC 
Thr Tyr 
255 

GTT CAT 
Val His 
270 

ACC ATC 
Thr He 



GAA GCC 
Glu Ala 



AGG GAT 
Arg Asp 



CAG AGT 
Gin Ser 
335 



GGC AGA GCT TAATTA 



334 



382 



430 



478 



526 



574 



622 



670 



718 



766 



814 



862 



910 



958 



1006 



1021 



ur>/cw\/ T 



Gly Arg Ala 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 338 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val 
15 10 15 

D Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val 

*0 20 25 30 

m 

~?>Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg Val 

ii 35 40 45 

I^Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu Phe 
O 50 55 60 

W 

3 Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly Ser 
Q 65 70 75 80 

-Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr 
^ 85 90 95 

m 

Ody Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser Gly 
j** 100 105 110 

Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys Phe Ala Gin 
115 120 125 

Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin Arg 
130 135 140 

Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro Ala 
145 150 155 160 

Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val 
165 170 175 

Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly Ala 
180 185 190 

Asn Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro Pro Arg Gly 
195 200 205 

Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys Thr 
210 215 220 

Cys Gly Gly Pro Pro Cys Asn He Gly Gly Ala Gly Asn Asn Thr Leu 

225 230 235 240 

Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Ala 

245 250 255 
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Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His Tyr 
260 265 270 

Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr He Phe 
275 280 285 

Lys Vai Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala Ala 
290 295 300 

Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg 
305 310 315 320 

Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin Ser Gly 
325 330 335 

Arg Ala 

S|(2) INFORMATION FOR SEQ ID NO: 39: 

%B (i) SEQUENCE CHARACTERISTICS: 

yij (A) LENGTH: 1034 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

5: (ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2.. 1032 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 2. .1029 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

G ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA 4 6 

He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly 
15 10 15 

GTC CTG GCG GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG 94 
Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys 
20 25 30 

GTT TTG GTT GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT ACC CGC 142 
Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg 
35 40 45 

GTG TCA GGA GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT GTG TCC CTC 190 
Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu 
50 55 60 



TTT AGC CCC GGG 
Phe Ser Pro Gly 
65 

AGT TGG CAC ATC 
Ser Trp His He 
80 

ACA GGG TTC TTT 
Thr Gly Phe Phe 



GGA 
Gly 



CAG 
rfiln 



^AGG 
^Arg 



D GCG 
E.lAla 
; v 160 

Ogtg 

Si Val 



.:GCG 

:i Ala 



GGC 
Gly 



ACG 
Thr 



TTG 
Leu 
240 
GCC 
Ala 



TGC CCA GAG 
Cys Pro Glu 
115 

GGG TGG GGT 
Gly Trp Gly 
130 

CCC TAC TGC 
Pro Tyr Cys 
145 

TCT CAG GTG 
Ser Gin Val 



GTG GGG ACG 
Val Gly Thr 



AAC GAC TCG 
Asn Asp Ser 
195 

AAC TGG TTC 
Asn Trp Phe 
210 

TGT GGG GGC 
Cys Gly Gly 
225 

ACC TGC CCC 
Thr Cys Pro 

AGA TGC GGT 
Arg Cys Gly 



TCG GCT 
Ser Ala 



AAC AGG 
Asn Arg 
85 

GCC GCA 
Ala Ala 
100 

CGC TTG 
Arg Leu 



CCC CTC 
Pro Leu 



TGG CAC 
Trp His 



TGC GGT 
Cys Gly 
165 

ACC GAT 
Thr Asd 
180 

GAT GTG 
Asp Val 



CAG AAA ATC 
Gin Lys He 
70 

ACT GCC CTG 
Thr Ala Leu 



CTA TTC TAC 
Leu Phe Tyr 



GCC AGC TGT 
Ala Ser Cys 
120 

ACT TAC ACT 
Thr Tyr Thr 
135 

TAC GCG CCT 
Tyr Ala Pro 
150 

CCA GTG TAT 
Pro Val Tyr 



CGG TTT GGT 
Arg Phe Gly 



CAG CTC 
Gin Leu 



AAC TGC 
Asn Cys 
90 

AAA CAC 
Lys His 
105 

CGC TCC 
Arg Ser 



GAG CCT 
Glu Pro 



CGA CCG 
Arg Pro 



GGC TGT 
Gly Cys 



CCC CCG 
Pro Pro 



TAC CCA TAT AGG 
Tyr Pro Tyr Arg 
275 

TTC AAG GTT AGG 
Phe Lys Val Arg 
290 

GCA TGC AAT TGG 
Ala Cys Asn Trp 



ACT GAC 
Thr Asp 
245 
TCT GGG 
Ser Gly 
260 

CTC TGG 
Leu Trp 



ATG TAC 
Met Tyr 



ACT CGA 
Thr Arg 



CTG ATT CTC 
Leu He Leu 
200 

ACA TGG ATG 
Thr Trp Met 
215 

TGC AAC ATC 
Cys Asn He 
230 

TGT TTT CGG 
Cys Phe Arg 

CCC TGG CTG 
Pro Trp Leu 



TGC TTC 
Cys Phe 
170 

GTC CCC 
Val Pro 
185 

AAC AAC 
Asn Asn 



GTA AAC ACC 
Val Asn Thr 
75 

AAC GAC TCC 
Asn Asp Ser 



AAA TTC AAC 
Lys Phe Asn 



ATC GAC AAG 
He Asp Lys 
125 

AAC AGC TCG 
Asn Ser Ser 
140 

TGT GGT ATT 
Cys Gly He 
155 

ACC CCG AGC 
Thr Pro Ser 



ACG TAT AAC 
Thr Tyr Asn 



AAT GGC 
Asn Gly 



GGG GGG 
Gly Gly 



CAC TAC CCC 
His Tyr Pro 
280 

GTG GGG GGC 
Val Gly Gly 
295 

GGA GAG CGT 
Gly Glu Arg 



AAG CAC 
Lys His 
250 
ACA CCT 
Thr Pro 
265 

TGC ACT 
Cys Thr 



GTG GAG 
Val Glu 



TGT GAC 
Cys Asp 



ACG CGG CCG 
Thr Arg Pro 
205 

ACT GGG TTC 
Thr Gly Phe 
220 

GCC GGC AAC 
Ala Gly Asn 
235 

CCC GAG GCC 
Pro Glu Ala 

AGG TGT ATG 
Arg Cys Met 



AAC GGC 
Asn Gly 



CTC CAA 
Leu Gin 
95 

TCG TCT 
Ser Ser 
110 

TTC GCT 
Phe Ala 



GAC CAG 
Asp Gin 



GTA CCC 
Val Pro 



CCT GTT 
Pro Val 
175 

TGG GGG 
Trp Gly 
190 

CCG CGA 
Pro Arg 



GTC AAC TTC 
Val Asn Phe 
285 

CAC AGG TTC 
His Arg Phe 
300 

TTG GAG GAC 
Leu Glu Asp 



ACC AAG 
Thr Lys 



AAC ACC 
Asn Thr 



ACC TAC 
Thr Tyr 
255 
GTT CAT 
Val His 
270 

ACC ATC 
Thr He 



GAA GCC 
Glu Ala 



AGG GAT 
Arg Asp 



238 

286 

334 

382 

430 

478 

526 

574 

622 

670 

718 

766 
814 

862 

910 

958 
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305 310 315 

AGA TCA GAG CTT AGC CCG CTG CTG CTG TCT ACA ACA GGT GAT CGA GGG 1006 
Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Gly Asp Arg Gly 
320 325 330 335 

CAG ACA CCA TCA CCA CCA TCA CTA AT AG 1034 
Gin Thr Pro Ser Pro Pro Ser Leu 
340 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 3 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

! (ii) MOLECULE TYPE: protein 

* (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



hi 1 


Pro 


Gin 


Ala 


Val 
5 


Val 


Asp 


Met 


Val 


Ala 
1U 


Gly 


Ala 


His 


Trp 


Gly 

i 0 


Val 


r%eu 


Ala 


Gly 


Leu 
20 


Ala 


Tyr 


Tyr 


Ser 


Met 
25 


Val 


Gly 


Asn TrD Ala 
30 


Lys 


Val 


CLeu 

SI 


Val 


Val 
35 


Met 


Leu 


Leu 


Phe 


Ala 
40 


Gly 


Val 


Asp 


Gly His 
45 


Thr 


Arg 


Val 


—Ser 


Gly 
50 


Gly 


Ala 


Ala 


Ala 


Ser 
55 


Asp 


Thr 


Arg 


Gly 


Leu 
60 


Val 


Ser 


Leu 


Phe 


HSer 
65 


Pro 


Gly 


Ser 


Ala 


Gin 
70 


Lys 


He 


Gin 


Leu 


Val 
75 


Asn 


Thr 


Asn 


Gly 


Ser 

80 


Trp 


His 


He 


Asn 


Arg 
85 


Thr 


Ala 


Leu 


Asn 


Cys 
90 


Asn 


Asp 


Ser 


Leu 


Gin 
95 


Thr 


Gly 


Phe 


Phe 


Ala 
100 


Ala 


Leu 


Phe 


Tyr 


Lys 
105 


His 


Lys 


Phe 


Asn 


Ser 
110 


Ser 


Gly 


Cys 


Pro 


Glu 
115 


Arg 


Leu 


Ala 


Ser 


Cys 
120 


Arg 


Ser 


He 


Asp 


Lys 
125 


Phe 


Ala 


Gin 


Gly 


Trp 
130 


Gly 


Pro 


Leu 


Thr 


Tyr 
135 


Thr 


Glu 


Pro 


Asn 


Ser 
140 


Ser 


Asp 


Gin 


Arg 


Pro 
145 


Tyr 


Cys 


Trp 


His 


Tyr 
150 


Ala 


Pro 


Arg 


Pro 


Cys 
155 


Gly 


He 


Val 


Pro 


Ala 
160 


Ser 


Gin 


Val 


Cys 


Gly 
165 


Pro 


Val 


Tyr 


Cys 


Phe 
170 


Thr 


Pro 


Ser 


Pro 


Val 
175 


Val 


Val 


Gly 


Thr 


Thr 
180 


Asp 


Arg 


Phe 


Gly 


Val 
185 


Pro 


Thr 


Tyr Asn Trp Gly Ala 
190 


Asn 


Asp 


Ser 
195 


Asp 


Val 


Leu 


He 


Leu 
200 


Asn 


Asn 


Thr 


Arg 


Pro 
205 


Pro Arg Gly 
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• Asn Trp Phe Gly 
210 

Cys Gly Gly Pro 
225 

Thr Cys Pro Thr 



Arg Cys Gly Ser 
260 

Pro Tyr Arg Leu 
275 

Lys Val Arg Met 
290 

Mlys Asn Trp Thr 
©05 

00 

~f$er Glu Leu Ser 



rirrhr Pro Ser Pro 
U 340 



Q{2) INFORMATION 

«! (i) SEQUENCE CHARACTERISTICS: 

^ (A) LENGTH: 94 5 base pairs 

3j (B) TYPE: nucleic acid 

O (C) STRANDEDNESS: single 

M< (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..942 

(ix) FEATURE: 

(A) NAME /KEY : mat_pept ide 

(B) LOCATION: 1..939 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

ATG GTG GGG AAC TGG GCT AAG GTT TTG GTT GTG ATG CTA CTC TTT GCC 4 8 

Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 

15 10 15 

GGC GTC GAC GGG CAT ACC CGC GTG TCA GGA GGG GCA GCA GCC TCC GAT 96 

Gly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp 

20 25 30 



Cys Thr 

Pro Cys 
230 

Asp Cys 
245 

Gly Pro 
Trp His 
Tyr Val 



Arg Gly 
310 

Pro Leu 
325 



Trp Met 
215 

Asn lie 
Phe Arg 
Trp Leu 



Asn Gly 
Gly Gly 



Lys His 
250 

Thr Pro 
265 



Cys Thr 
Val Glu 
Glu Arg Cys Asp 



Tyr Pro 
280 

Gly Gly 
295 



Leu Leu 



Ser Thr 
330 



Thr Gly Phe 
220 

Ala Gly Asn 
235 

Pro Glu Ala 



Arg Cys Met 



Val Asn Phe 
285 

His Arg Phe 
300 

Leu Glu Asp 
315 

Thr Gly Asp 



Thr Lys Thr 

Asn Thr Leu 
240 

Thr Tyr Ala 
255 

Val His Tyr 
270 

Thr He Phe 
Glu Ala Ala 



Arg Asp Arg 
320 

Arg Gly Gin 
335 



Pro Ser Leu 



FOR SEQ ID NO: 41: 



ACC 
Thr 



CAG 
Gin 



AAC 
Asn 
65 

AAA 
Lys 



CGC 
.Arg 



AGG GGC 
Arg Gly 
35 

CTC GTA 
Leu Val 
50 

TGC AAC 
Cys Asn 



CAC AAA 
His Lys 



TCC ATC 
Ser He 



Egag 

*DG1u 



«:cga 

HArg 

pTGC 
k \Cys 
q145 

Jgtc 

Uval 



AAC 
Asn 



CCT AAC 
Pro Asn 
115 

CCG TGT 
Pro Cys 
130 

TTC ACC 
Phe Thr 



CCC ACG 
Pro Thr 



AAC ACG 
Asn Thr 



AAT 
Asn 



GGG 
Gly 



AAG 
Lys 
225 

ACA 
Thr 



TGC 
Cys 



GGC ACT 
Gly Thr 
195 

GGG GCC 
Gly Ala 
210 

CAC CCC 
His Pro 



CCT AGG 
Pro Arg 



ACT GTC 
Thr Val 



CTT GTG 
Leu Val 



AAC ACC 
Asn Thr 



GAC TCC 
Asp Ser 



TTC AAC 
Phe Asn 
85 

GAC AAG 
Aso Lys 
100 

AGC TCG 
Ser Ser 



GGT ATT 
Gly He 



CCG AGC 
Pro Ser 



TAT AAC 
Tyr Asn 
165. 

CGG CCG 
Arg Pro 
180 

GGG TTC 
Gly Phe 



TCC CTC 
Ser Leu 



AAC GGC 
Asn Gly 
55 

CTC CAA 
Leu Gin 
70 

TCG TCT 
Ser Ser 



TTC GCT 
Phe Ala 



GAC CAG 
Asp Gin 



GTA CCC 
Val Pro 
135 

CCT GTT 
Pro Val 
150 

TGG GGG 
Trp Gly 



CCG CGA 
Pro Arg 

ACC AAG 
Thr Lys 



GGC AAC 
Gly Asn 



GAG GCC 
Glu Ala 



TGT ATG 
Cys Met 
245 

AAC TTC 
Asn Phe 
260 



AAC ACC 
Asn Thr 
215 

ACC TAC 
Thr Tyr 
230 

GTT CAT 
Val His 



ACC ATC 
Thr He 



TTT AGC CCC 
Phe Ser Pro 
40 

AGT TGG CAC 
Ser Trp His 



ACA GGG TTC 
Thr Gly Phe 



GGA TGC CCA 
Gly Cys Pro 
90 

CAG GGG TGG 
Gin Gly Trp 
105 

AGG CCC TAC 
Arg Pro Tyr 
120 

GCG TCT CAG 
Ala Ser Gin 



GTG GTG GGG 
Val Val Gly 



GCG AAC GAC 
Ala Asn Aso 
170 

GGC AAC TGG 
Gly Asn Trp 
185 

ACG TGT GGG 
Thr Cys Gly 
200 

TTG ACC TGC 
Leu Thr Cys 



GGG TCG GCT CAG AAA ATC 
Gly Ser Ala Gin Lys He 
45 

ATC AAC AGG ACT GCC CTG 
He Asn Arg Thr Ala Leu 
60 

TTT GCC GCA CTA TTC TAC 
Phe Ala Ala Leu Phe Tyr 
75 80 

GAG CGC TTG GCC AGC TGT 
Glu Arg Leu Ala Ser Cys 
95 

GGT CCC CTC ACT TAC ACT 
Gly Pro Leu Thr Tyr Thr 
110 

TGC TGG CAC TAC GCG CCT 
Cys Trp His Tyr Ala Pro 
125 

GGT CCA GTG TAT 
Gly Pro Val Tyr 



GCC AGA TGC 
Ala Arg Cys 



TAC CCA TAT 
Tyr Pro Tyr 
250 

TTC AAG GTT 
Phe Lys Val 
265 



GTG TGC 
Val Cys 
140 

ACG ACC 
Thr Thr 
155 

TCG GAT 
Ser Asp 



TTC GGC 
Phe Gly 

GGC CCC 
Gly Pro 



CCC ACT 
Pro Thr 
220 

GGT TCT 
Gly Ser 
235 

AGG CTC 
Arg Leu 



AGG ATG 
Arg Met 



GAT CGG 
Asp Arg 



GTG CTG 
Val Leu 



TGT ACA 
Cys Thr 
190 

CCG TGC 
Pro Cys 
205 

GAC TGT 
Asp Cys 



GGG CCC 
Gly Pro 

TGG CAC 
Trp His 



TAC GTG 
Tyr Val 
270 



TTT GGT 
Phe Gly 
160 

ATT CTC 
lie Leu 
175 

TGG ATG 
Trp Met 



AAC ATC 
Asn lie 



TTT CGG 
Phe Arg 



TGG CTG 
Trp Leu 
240 

TAC CCC 
Tyr Pro 
255 

GGG GGC 
Gly Gly 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



GTG GAG CAC AGG TTC GAA GCC GCA TGC AAT TGG ACT CGA GGA GAG CGT 



864 
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Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp -Thr Arg Gly Glu' Arg 
275 280 285 

TGT GAC TTG GAG GAC AGG GAT AGA TCA GAG CTT AGC CCG CTG CTG CTG 912 
Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 
290 295 300 

TCT ACA ACA GAG TGG CAG AGC TTA ATT AAT TAG 945 
Ser Thr Thr Glu Trp Gin Ser Leu lie Asn 
305 310 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 4 

(A) LENGTH: 314 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

!! (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

—Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 
^; 1 5 10 15 

UJGly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp 
s 20 25 30 

^Thr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He 
>| 35 40 45 

ff j Gln Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu 
Ci 50 55 60 

Asn Cys Asn Asd Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr 
65 ^ 70 75 80 

Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 
85 90 95 

Arg Ser He Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr 
100 105 110 

Glu Pro Asn Ser Ser Asd Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro 
115 * 120 125 

Arg Pro Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr 
130 135 140 

Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly 
145 150 155 160 

Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu 
165 170 175 

Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe- Gly Cys Thr Trp Met 
180 185 190 

Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He 
195 200 205 



Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg 
210 215 220 

Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu 
225 230 235 240 

Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro 
245 250 255 

Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Met Tyr Val Gly Gly 
260 265 270 

Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 
275 280 285 

Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 
p 290 295 300 

5'Ser Thr Thr Glu Trp Gin Ser Leu He Asn 
J>05 310 

J2(2) INFORMATION FOR SEQ ID NO: 43: 

S (i) SEQUENCE CHARACTERISTICS: 

U\ (A) LENGTH: 961 base pairs 

y (B) TYPE: nucleic acid 

L (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

D (ii) MOLECULE TYPE: cDNA 

Q (iii) HYPOTHETICAL: NO 

^ (iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..958 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1,-955 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

ATG GTG GGG AAC TGG GCT AAG GTT TTG GTT GTG ATG CTA CTC TTT GCC 48 

Met Val Gly Asn Tro Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 

1 5 10 15 

GGC GTC GAC GGG CAT ACC CGC GTG TCA GGA GGG GCA GCA GCC TCC GAT 96 

Gly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp 

20 25 30 

ACC AGG GGC CTT GTG TCC CTC TTT AGC CCC GGG-TCG GCT CAG AAA AT C 14 4 

Thr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He 

35 40 45 

CAG CTC GTA AAC ACC AAC GGC AGT TGG CAC ATC AAC AGG ACT GCC CTG 192 





Gin Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu 
50 55 60 

AAC TGC AAC GAC TCC CTC CAA ACA GGG TTC TTT GCC GCA CTA TTC TAC 240 

Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe ?he Ala Ala Leu Phe Tyr 
65 70 75 80 



AAA CAC AAA TTC AAC TCG TCT GGA TGC CCA GAG CGC TTG GCC AGC TGT 
Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 
85 90 95 



288 



CGC TCC ATC GAC AAG TTC GCT CAG GGG TGG GGT CCC CTC ACT TAC ACT 33 6 

Arg Ser He Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr 
100 105 110 

GAG CCT AAC AGC TCG GAC CAG AGG CCC TAC TGC TGG CAC TAC GCG CCT 38 4 

Glu Pro Asn Ser Ser Aso Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro 
115 120 125 

Si CGA CCG TGT GGT ATT GTA CCC GCG TCT CAG GTG TGC GGT CCA GTG TAT 432 

E?Arg Pro Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr 
.7: 130 135 140 

= ! TGC TTC ACC CCG AGC CCT GTT GTG GTG GGG ACG ACC GAT CGG TTT GGT 480 

^ j Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly 
CU45 . 150 155 160 

T'gTC CCC ACG TAT AAC TGG GGG GCG AAC GAC TCG GAT GTG CTG ATT CTC 528 
L.Val Pro Thr Tyr Asn Trp Gly Ala Asn Aso Ser As? Val Leu He Leu 

165 170 175 

©AAC AAC ACG CGG CCG CCG CGA GGC AAC TGG TTC GGC TGT ACA TGG ATG 57 6 

ylAsn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met 

Q 180 185 190 

^AAT GGC ACT GGG TTC ACC AAG ACG TGT GGG GGC CCC CCG TGC AAC ATC 624 

Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He 

195 200 205 

GGG GGG GCC GGC AAC AAC ACC TTG ACC TGC CCC ACT GAC TGT TTT CGG 672 
Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg 
' 210 215 220 

AAG CAC CCC GAG GCC ACC TAC GCC AGA TGC GGT TCT GGG CCC TGG CTG 720 
Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu 
225 230 235 240 

ACA CCT AGG TGT ATG GTT CAT TAC CCA TAT AGG CTC TGG CAC TAC CCC 7 68 

Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro 
245 250 255 

TGC ACT GTC AAC TTC ACC ATC TTC AAG GTT AGG ATG TAC GTG GGG GGC 816 
Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Met Tyr Val Gly Gly 
260 265 270 

GTG GAG CAC AGG TTC GAA GCC GCA TGC AAT TGG ACT CGA GGA GAG CGT 8 64 

Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 
275 280 285 

TGT GAC TTG GAG GAC AGG GAT AGA TCA GAG CTT AGC CCG CTG CTG CTG 912 
Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 



961 



290 295 300 

TCT ACA ACA GGT GAT CGA GGG CAG ACA CCA TCA CCA CCA TCA CTA A 958 
Ser Thr Thr Gly Asp Arg Gly Gin Thr Pro Ser Pro Pro Ser Leu 
305 310 315 

TAG 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 319 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

Q (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

-Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 
^; 1 5 10 15 

%M31y Val Aso Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp 
yd 20 25 30 

MThr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He 
™ 35 40 45 

Cbln Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu 
SI 50 55 60 

Sftsn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr 
r ~: 65 70 75 80 

Hlys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 

85 90 95 

Arg Ser He Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr 
100 105 110 

Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro 
115 120 125 

Arg Pro Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr 
130 135 140 

Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly 
145 150 155 160 

Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu 
165 170 175 

Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met 
180 185 190 

Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He 
195 200 205 

Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg 

210 215 220 

Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu 





225 230 235 240 

Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro 
245 250 255 

Cys Thr Val Asn Phe Thr He Phe Lys Vai Arg Met Tyr Val Gly Gly 
260 265 270 

Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 
275 280 285 

Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 
290 295 300 

Ser Thr Thr Gly Asp Arg Gly Gin Thr Pro Ser Pro Pro Ser Leu 
•305 310 315 

(2) INFORMATION FOR SEQ ID NO: 45: 

:i (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1395 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..1392 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1..1389 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



ATG 


GTG 


GCG 


GGG 


GCC 


CAT 


TGG 


GGA 


GTC 


CTG 


GCG 


GGC 


CTC 


GCC 


TAC 


TAT 


48 


Met 


Val 


Ala 


Gly 


Ala 


His 


Trp 


Gly 


Val 


Leu 


Ala 


Gly Leu Ala 


Tyr 


Tyr 




1 






5 










10 










15 






TCC 


ATG 


GTG 


GGG 


AAC 


TGG 


GCT 


AAG 


GTT 


TTG 


GTT 


GTG 


ATG 


CTA 


CTC 


TTT 


96 


Ser 


Met 


Val 


Gly 


Asn 


Trp 


Ala 


Lys 


Val 


Leu 


Val 


Val 


Met 


Leu 


Leu 


Phe 










20 








25 










30 








GCC 


GGC 


GTC 


GAC 


GGG 


CAT 


ACC 


CGC 


GTG 


TCA 


GGA 


GGG 


GCA 


GCA 


GCC 


TCC 


14 4 


Ala 


Gly Val 


Asp 


Gly 


His 


Thr 


Arg 


Val 


Ser 


Gly 


Gly Ala 


Ala 


Ala 


Ser 








35 








40 










45 










GAT 


ACC 


AGG 


GGC 


CTT 


GTG 


TCC 


CTC 


TTT 


AGC 


CCC 


GGG 


TCG 


GCT 


CAG 


AAA 


192 


Asp 


Thr Arg 


Gly 


Leu 


Val 


Ser 


Leu 


Phe 


Ser 


Pro 


Gly 


Ser 


Ala 


Gin 


Lys 




50 








55 










60 












ATC 


CAG 


CTC 


GTA 


AAC 


ACC 


AAC 


GGC 


AGT 


TGG 


CAC 


ATC 


AAC 


AGG 


ACT 


GCC 


240 


He 


Gin 


Leu 


Val 


Asn 


Thr 


Asn 


Gly 


Ser 


Trp 


His 


He 


Asn 


Arg Thr Ala 




65 










70 










75 










80 





CTG AAC 
Leu Asn 



TAC AAA 
Tyr Lys 

TGT CGC 
Cys Arg 



ACT GAG 
Thr Glu 
130 

pCCT CGA 
l&ro Arg 

"CtAT TGC 
CTyr Cys 



TGC AAC GAC 
Cys Asn Asp 
85 

CAC AAA TTC 
His Lys Phe 
100 

TCC ATC GAC 
Ser lie Asp 
115 

CCT AAC AGC 
Pro Asn Ser 



CCG TGT GGT 
Pro Cys Gly 



TTC ACC CCG 
Phe Thr Pro 
165 



TCC CTC 
Ser Leu 



AAC TCG 
Asn Ser 



AAG TTC 
Lys Phe 



TCG GAC 
Ser Asp 
135 

ATT GTA 
He Val 
150 

AGC CCT 
Ser Pro 



CAA ACA GGG TTC 
Gin Thr Gly Phe 
90 

TCT GGA TGC CCA 
Ser Gly Cys Pro 
105 

GCT CAG GGG TGG 
Ala Gin Gly Trp 
120 

CAG AGG CCC TAC 
Gin Arg Pro Tyr 



TTT GCC 
Phe Ala 



GAG CGC 
Glu Arg 



CCC GCG 
Pro Ala 



GTT GTG 
Val Val 



TCT CAG 
Ser Gin 
155 

GTG GGG 
Val Gly 
170 



GGT CCC 
Gly Pro 
125 

TGC TGG 
Cys Trp 
140 

GTG TGC 
Val Cys 



ACG ACC 
Thr Thr 



GCA CTA TTC 
Ala Leu Phe 
95 

TTG GCC AGC 
Leu Ala Ser 
110 

CTC ACT TAC 
Leu Thr Tyr 



CAC TAC GCG 
His Tyr Ala 



GGT CCA GTG 
Gly Pro Val 
160 

GAT CGG TTT 
Aso Arg Phe 
175 



rGGT GTC 
"fcly Val 



%CTC AAC 
IZheu Asn 



^ATG AAT 
HMet Asn 
210 

ATC GGG 

He Gly 
225 

CGG AAG 

Arg Lys 



CCC ACG TAT 
Pro Thr Tyr 
180 

AAC ACG CGG 
Asn Thr Arg 
195 

GGC ACT GGG 
Gly Thr Gly 



GGG GCC GGC 
Gly Ala Gly 



CAC CCC GAG 
His Pro Glu 
245 



AAC TGG 
Asn Trp 



CCG CCG 
Pro Pro 



TTC ACC 
Phe Thr 
215 

AAC AAC 
Asn Asn 
230 

GCC ACC 
Ala Thr 



GGG GCG 
Gly Ala 
185 

CGA GGC 
Arg Gly 
200 

AAG ACG 
Lys Thr 



AAC GAC 
Asn Asd 



AAC TGG 
Asn Trp 

TGT GGG 
Cys Gly 



ACC TTG ACC TGC 
Thr Leu Thr Cys 
235 

TAC GCC AGA TGC 
Tyr Ala Arg Cys 
250 



TCG GAT 
Ser Aso 



TTC GGC 
Phe Gly 
205 

GGC CCC 
Gly Pro 
220 

CCC ACT 
Pro Thr 



GGT TCT 
Gly Ser 



GTG CTG ATT 
Val Leu He 
190 

TGT ACA TGG 
Cys Thr Trp 



CCG TGC AAC 
Pro Cys Asn 



GAC TGT TTT 
Asp Cys Phe 
240 

GGG CCC TGG 
Gly Pro Trp 
255 



CTG ACA CCT AGG 
Leu Thr Pro Arg 
260 

CCC TGC ACT GTC 
Pro Cys Thr Val 
275 



TGT ATG GTT CAT 
Cys Met Val His 

AAC TTC ACC ATC 
Asn Phe Thr He 
280 



TAC CCA TAT AGG 
Tyr Pro' Tyr Arg 
265 

TTC AAG GTT AGG 
Phe Lys Val Arg 



CTC TGG CAC TAC 
Leu Trp His Tyr 
270 

ATG TAC GTG GGG 
Met Tyr Val Gly 
285 



GGC GTG GAG CAC AGG TTC GAA GCC GCA TGC AAT TGG ACT CGA GGA GAG 

Gly Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu 

290 295 300 

CGT TGT GAC TTG GAG GAC AGG GAT AGA TCA GAG CTT AGC CCG CTG CTG 

Arg Cys Asd Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu 

305 * 310 315 320 



CTG TCT ACA ACA GAG TGG CAG ATA CTG CCC TGT TCC TTC ACC ACC CTG 



Leu 


Ser 


Thr 


Thr 


Glu 
325 


Trp 


Gin 


lie 


Leu 


Pro 
330 


Cys 


Ser 


Phe 


Thr 


Thr 
335 


Leu 




CCG 
Pro 


GCC 
Ala 


CTA 
Leu 


TCC 
Ser 
340 


ACC 
Thr 


GGC 
Gly 


CTG 
Leu 


ATC 
He 


CAC 
His 
345 


CTC 
Leu 


CAT 
His 


CAG 
Gin 


AAC 
Asn 


ATC 
He 
350 


GTG 
Val 


GAC 
Asp 


1UD O 


GTG 
Val 

\ 


CAA 
Gin 


TAC 
Tyr 
355 


CTG 
Leu 


TAC 
Tyr 


GGT 
Gly 


GTA 
Val 


GGG 
Gly 
360 


TCG 
Ser 


GCG 
Ala 


/"•mm 

GTT 
Val 


GTC 
Val 


TCC 
Ser 
365 


CTT 
Leu 


GTC 
Val 


Ile 




AAA 
Lys 


TGG 
Trp 
370 


GAG 
Glu 


TAT 
Tyr 


GTC 
Val 


CTG 
Leu 


T ru 
Leu 
375 


LIU 

Leu 


1 1 V- 

Phe 


Li 1 

Leu 


LIU 

Leu 


Liu 

Leu 
380 


Ala 


Asp 


Ala 


Arg 


J. J. -J 


ATC 
He 


TGC 
Cys 


GCC 
Ala 


TGC 
Cys 


TTA 
Leu 


TGG 
Trp 
390 


ATG 
Met 


ATG 
Met 


CTG 
Leu 


CTG 
Leu 


ATA 
He 
395 


GCT 
Ala 


CAA 
Gin 


GCT 
Ala 


GAG 
Glu 


GCC 
Ala 
400 






TTA 
Leu 


GAG 
Glu 


AAC 
Asn 


CTG 
Leu 
405 


GTG 
Val 


GTC 
Val 


CTC 
Leu 


AAT 
Asn 


GCG 
Ala 
410 


GCG 
Ala 


GCC 
Ala 


GTG 
Val 


GCC 
Ala 


GGG 
Gly 
415 


GCG 
Ala 


1 OylQ 

L4. H O 


2*is 

i—i 


GGC 
Gly 


ACT 
Thr 


CTT 
Leu 
420 


TCC 
Ser 


TTC 
Phe 


CTT 
Leu 


GTG 
Val 


TTC 
Phe 
425 


TTC 
Phe 


TGT 
Cys 


GCT 
Ala 


GCC 
Ala 


TGG 
Tro 
4 30 


TAC 
Tyr 


ATC 
He 


1 1 ("i fZ 

1296 


sAAG- 


GGC 
Gly 


AGG 
Arg 
435 


CTG 
Leu 


GTC 
Val 


CCT 
Pro 


GGT 
Gly 


GCG 
Ala 
440 


GCA 
Ala 


TAC 
Tyr 


GCC 
Ala 


TTC 
Phe 


TAT 
Tyr 
445 


GGC 
Gly 


GTG 
Val 


TGG 
Trp 


1344 


bCCG 

yi>ro 


CTG 
Leu 
450 


CTC 
Leu 


CTG 
Leu 


Leu 


CTG 
Leu 


CTG 
Leu 
455 


GCC 
Ala 


TTA 
Leu 


CCA 
Pro 


CCA 
Pro 


CGA 
Arg 
460 


GCT 
Ala 


TAT 
Tyr 


GCC 
Ala 


TAGTAA 


1395 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 63 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Met Val Ala Gly Ala His Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr 
15 10 15 

Ser Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe 
20 25 30 

Ala Gly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser 
35 40 45 

Asp Thr Arg Gly Leu Val Ser Leu Phe- Ser Pro Gly Ser Ala Gin Lys 
50 55 60 

He Gin Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala 
65 70 75 80 





Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe 
85 90 95 

Tyr Lys Hi's Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser 
100 105 110 

Cys Arg Ser He Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr 
115 120 125 

Thr Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala 
130 135 140 

Pro Arg Pro Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val 
145 150 155 160 

Tyr Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe 
165 170 175 

4ily Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He 
^ 180 185 190 

djLeu Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp 
if! 195 200 205 

%et Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn 
^ 210 215 220 

W 

3 He Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe 
Q25 230 235 240 

-Arg Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp 
Si 245 250 255 

y i 

MLeu Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr 
M 260 265 270 

Pro Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Met Tyr Val Giy 
275 280 285 

Gly Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu 
290 295 300 

Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu 
305 310 315 320 

Leu Ser Thr Thr Glu Trp Gin He Leu Pro Cys Ser Phe Thr Thr Leu 
325 330 335 

Pro Ala Leu Ser Thr Gly Leu He His Leu His Gin Asn He Val Asp 
340 345 350 

Val Gin Tyr Leu Tyr Gly Val Gly Ser Ala Val Val Ser Leu Val He 
355 360 365 

Lys Trp Glu Tyr Val Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg 
370 375 380 

He Cys Ala Cys Leu Trp Met Met Leu Leu He Ala Gin Ala Glu Ala 
385 390 395 400 

Ala Leu Glu Asn Leu Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala 



405 410 415 

His Gly Thr Leu Ser Phe Leu Val Phe Phe Cys Ala Ala Trp Tyr lie 
420 425 430 

Lys Gly Arg Leu Val Pro Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp 
435 440 445 

Pro Leu Leu Leu Leu Leu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
450 455 460 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
«| (D) TOPOLOGY: linear 

B (ii) MOLECULE TYPE: cDNA 

fjj (iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



^ (ix) FEATURE: 

D (A) NAME /KEY : CDS 

SA (B) LOCATION: 1. .2079 

p! . (ix) FEATURE: 

^! (A) NAME /KEY : mat_peptide 

ji; (B) LOCATION: 1..2076 

Ljs, 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

AAT TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 48 
Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 14 4 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 240 
Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

ICC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 
Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 



TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 
Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 
Val Arc Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 
Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His 
130 135 140 

GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT TCC GCT ATG TAC GTG 
Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

GGG GAC CTC TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC 
Glv Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 

Zi 165 170 175 

fi 

f-TCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT 

='ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
: 180 185 



336 



384 



432 



480 



528 



576 



190 



tfccC GGC CAC ATA ACG GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC 
-\?ro Gly His lie Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 

u! 195 200 205 

=.TGG TCG CCT ACA ACG GCC CTG GTG GTA TCG CAG CTG CTC CGG ATC CCA 
=l Trp Se*- Pro Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro 
y 210 215 220 

_! 

?;CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA GTC CTG GCG 

-iGln Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala 

*225 230 235 240 

GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG GTT TTG GTT 
Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu Val 
245 250 255 

GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT ACC CGC GTG TCA GGA 
Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg Val Ser Gly 
260 265 270 

GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT GTG TCC CTC TTT AGC CCC 
Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu Phe Ser Pro 
275 280 285 

GGG TCG GCT CAG AAA ATC CAG CTC GTA AAC ACC AAC GGC AGT TGG CAC 
Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly Ser Trp His 
290 295 300 

ATC AAC AGG ACT GCC CTG AAC TGC AAC GAC TCC CTC CAA ACA GGG TTC 
He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe 
305 310 315 320 

TTT GCC GCA CTA TTC TAC AAA CAC AAA TTC AAC TCG TCT GGA TGC CCA 
Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser Gly Cys Pro 
325 330 335 



624 



67 2 



720 



763 



816 



864 



912 



960 



1008 



GAG CGC TTG GCC AGC TGT CGC TCC ATC GAC AAG TTC GCT CAG GGG TGG 



1056 







Glu 


Arq 


Leu 


Ala 


Ser 


Cys 


Arg 


Ser 


He 


Asp 


Lys 


Phe 


Ala 


Gin 


Gly 


Trp 










340 








345 










350 










GGT 


CCC 


CTC 


ACT 


TAC 


ACT 


GAG 


CCT 


AAC 


AGC 


TCb 


bAG 


GAG 


Abb 


GGG 


1 AG 


1 1 DA 
11111 




Glv 


Pro 


Leu 


Thr 


Tyr 


Thr 


Glu 


Pro 


Asn 


Ser 


Ser 


Asp 


Gin 


Arg 


Pro 


Tyr 








355 








360 










365 












TGC 


TGG 


CAC 


TAC 


GCG 


CCT 


CGA 


CCG 


TGT 


GGT 


ATT 


GTA 


r*> r*> 

CCC 


GCG 


TCT 


CAG 


1 1 CO 




Cys 


Tro 


His 


Tvr 


Ala 


Pro 


Arg 


Pro 


Cys 


Gly 


He 


Val 


Pro 


Ala 


Ser 


Gin 






370 










375 










380 














GTG 


TGC 


GGT 


CCA 


GTG 


TAT 


TGC 


TTC 


ACC 


CCG 


AGC 


CCT 


GTT 


GTG 


GTG 


GGG 


1200 




Val 


Cys 


Glv 


Pro 


Val 


Tyr 


Cys 


Phe 


Thr 


Pro 


Ser 


Pro 


Val 


Val 


Val 


Gly 






385 






390 










395 










400 






ACG 


ACC 


GAT 


CGG 


TTT 


GGT 


GTC 


CCC 


ACG 


TAT 


AAC 


TGG 


GGG 


GCG 


AAC 


GAC 


1248 




t! Thr 


Thr 


Asp 


Arg 


Phe 


Gly 


Val 


Pro 


Thr 


Tyr 


Asn 


Trp 


Gly 


Ala 


Asn 


Asp 










405 








410 










415 






IS™ 


] TCG 


GAT 


GTG 


CTG 


ATT 


CTC 


AAC 


AAC 


ACG 


CGG 


CCG 


CCG 


CGA 


c 

GGC 


7\ TV /-< 

AAC 


TGG 


LCv 0 




iSer 


Asp 


Val 


Leu 


He 


Leu 


Asn 


Asn 


Thr 


Arg 


Pro 


Pro 


Arg 


Gly 


Asn 


Trp 






i 




420 










425 










430 










|ttc 


GGC 


TGT 


ACA 


TGG 


ATG 


AAT 


GGC 


ACT 


GGG 


TTC 


ACC 


AAG 


ACG 


IG i 


GGG 


1 7 A A 
1 4 




IPhe 


Glv 


Cys 


Thr 


Trp 


Met 


Asn 


Gly 


Thr 


Gly 


Phe 


Thr 


Lys 


Thr 


Cys 


Gly 




s ; 
5 s I 


435 










440 










445 












GGC 


CCC 


CCG 


TGC 


AAC 


ATC 


GGG 


GGG 


GCC 


GGC 


AAC 


AAC 


ACC 


1 lb 


AGG 


1GG 


1 J7i 




IGly 


Pro 


Pro 


Cys 


Asn 


He 


Gly 


Gly 


Ala 


Gly 


Asn 


Asn 


Thr 


Leu 


Thr 


Cys 




>l i 


450 








455 










460 












ill! 


ccc 


ACT 


GAC 


TGT 


TTT 


CGG 


AAG 


CAC 


CCC 


GAG 


GCC 


ACC 


TAC 


GCC 


AGA 


TGC 


1440 


!~i i 


Pro 


Thr 


Asp 


Cys 


Phe 


Arg 


Lys 


His 


Pro 


Glu 


Ala 


Thr 


Tyr 


Ala 


Arg 


Cys 




a { j 

S I 


465 






470 










475 










480 






GGT 


TCT 


GGG 


CCC 


TGG 


CTG 


ACA 


CCT 


AGG 


TGT 


ATG 


GTT 


CAT 


TAC 


CCA 


TAT 


14o o 




Gly 


Ser 


Gly 

W J. J 


Pro 


Trp 


Leu 


Thr 


Pro 


Arg 


Cys 


Met 


Val 


His 


Tyr 


Pro 


Tyr 










485 










4 90 










495 








AGG 


CTC 


TGG 


CAC 


TAC 


CCC 


TGC 


ACT 


GTC 


AAC 


TTC 


ACC 


ATC 


TTC 


AAG 


GTT 


1 D Jb 




Arg 


Leu 


Trp 


His 


Tyr 


Pro 


Cys 


Thr 


Val 


Asn 


Phe 


Thr 


He 


Phe 


Lys 


Val 








500 








505 










510 










AGG 


ATG 


TAC 


GTG 


GGG 


GGC 


GTG 


GAG 


CAC 


AGG 


TTC 


GAA 


GCC 


GCA 


TGG 


AA1 


10 0 4 




Arg 


Met 


Tyr 


Val 


Gly 


Gly 


Val 


Glu 


His 


Arg 


Phe 


Glu 


Ala 


Ala 


Cys 


Asn 








515 










520 










525 












TGG 


ACT 


CGA 


GGA 


GAG 


CGT 


TGT 


GAC 


TTG 


GAG 


GAC 


AGG 


GAT 


AGA 


TCA 


GAb 


10 J£ 




Trp 


Thr 


Arq 


Gly 


Glu 


Arg 


Cys 


Asp 


Leu 


Glu 


Asp 


Arg 


Asp 


Arg 


Ser 


Glu 






530 








535 










540 














CTT 


AGC 


CCG 


CTG 


CTG 


CTG 


TCT 


ACA 


ACA 


GAG 


TGG 


CAG 


ATA 


CTG 


GGG 


lb 1 


lOOU 




Leu 


Ser 


Pro 


Leu 


Leu 


Leu 


Ser 


Thr 


Thr 


Glu 


Trp 


Gin 


He 


Leu 


Pro 


Cys 






545 










550 










555 










560 






TCC 


TTC 


ACC 


ACC 


CTG 


CCG 


GCC 


CTA 


TCC 


ACC 


GGC 


CTG 


ATC 


CAC 


CTC 


CAT 


1728 




Ser 


Phe 


Thr 


Thr 


Leu 


Pro 


Ala 


Leu 


Ser 


Thr 


Gly 


Leu 


He 


His 


Leu 


His 














565 










570 










575 








CAG 


AAC 


ATC 


GTG 


GAC 


GTG 


CAA 


TAC 


CTG 


TAC 


GGT 


GTA 


GGG 


TCG 


GCG 


GTT 


1776 




Gin 


Asn 


He 


Val 


Asp 


Val 


Gin 


Tyr 


Leu 


Tyr 


Gly 


Val 


Gly 


Ser 


Ala 


Val 





580 



585 



590 




GTC 


TCC 


CTT 


GTC 


ATC 


AAA 


TGG 


GAG 


TAT 


GTC 


CTG 


TTG 


CTC 


TTC 


CTT 


CTC 


1824 


Val 


Ser 


Leu 


Val 


He 


Lys 


Trp 


Glu 


Tyr 


Val 


Leu 


L&u 


Leu 


Phe 


Leu 


Leu 








595 










600 










605 










CTG 


GCA 


GAC 


GCG 


CGC 


ATC 


TGC 


GCC 


TGC 


TTA 


TGG 


ATG 


ATG 


CTG 


CTG 


ATA 


1872 


Leu 


Ala 


Asp 


Ala 


Arg 


He 


Cys 


Ala 


Cys 


Leu 


Trp Met 


Met 


Leu 


Leu 


He 






610 










615 










620 












GCT 


CAA 


GCT 


GAG 


GCC 


GCC 


TTA 


GAG 


AAC 


CTG 


GTG 


GTC 


CTC 


AAT 


GCG 


GCG 


1920 


Ala 


Gin 


Ala 


Glu 


Ala 


Ala 


Leu 


Glu 


Asn 


Leu 


Val 


Val 


Leu 


Asn 


Ala 


Ala 




625 










630 










635 










ca n 
o4 U 




GCC 


GTG 


GCC 


GGG 


GCG 


CAT 


GGC 


ACT 


CTT 


TCC 


TTC 


CTT 


GTG 


TTC 


TTC 


TGT 


1968 


Ala 


Val 


Ala 


Gly 


Ala 


His 


Gly Thr 


Leu 


Ser 


Phe 


Leu 


Val 


Phe 


Phe 


Cys 










645 










650 










655 






GCT 


GCC 


TGG 


TAC 


ATC 


AAG 


GGC 


AGG 


CTG 


GTC 


CCT 


GGT 


GCG 


GCA 


TAC 


GCC 


2016 


Ala 


Ala 


Trp 


Tyr 


He 


Lys 


Gly Arg 


Leu 


Val 


Pro 


Gly Ala 


Ala 


Tyr 


Ala 




t. 




660 










665 










670 








=TTC 


TAT 


GGC 


GTG 


TGG 


CCG 


CTG 


CTC 


CTG 


CTT 


CTG 


CTG 


GCC 


TTA 


CCA 


CCA 


2064 




Tyr 


Gly 


Val 


Trp 


Pro 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Ala 


Leu 


Pro 


Pro 






675 










680 










685 











ItGA GCT TAT GCC TAGTAA 2082 
jArg Ala Tyr Ala 
690 



2;j(2) INFORMATION FOR SEQ ID NO: 48: 

fji (i) SEQUENCE CHARACTERISTICS: 
p (A) LENGTH: 692 amino acids 

i_ s . (B) TYPE: amino acid 

r " (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu i 
*1 5 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 



Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 



Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Vai Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 

Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
^ 195 200 205 

CtTrp Ser Pro Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro 
Hi 210 215 220 

%ln Ala Val Val Asd Met Val Ala Gly Ala His Trp Gly Val Leu Ala 
^25 ^ 230 235 240 

&ly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu Val 
Ul 245 * 250 255 

rt/al Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg Val Ser Gly 
~! 260 265 270 

Cbly Ala Ala Ala Ser Asd Thr Arg Gly Leu Vai Ser Leu Phe Ser Pro 
yl 275 ' 280 285 

UGly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly Ser Trp His 
290 295 300 

He Asn Arg Thr Ala Leu Asn Cys Asn Aso Ser Leu Gin Thr Gly Phe 
305 310 315 320 

Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser Gly Cys Pro 
325 330 335 

Glu Arg Leu Ala Ser Cys Arg Ser lie Asp Lys Phe Ala Gin Gly Trp 
340 345 350 

Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr 
355 360 365 

Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro Ala Ser Gin 
370 375 380 

Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val Val Gly 
385 390 395 400 

Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp 
405 410 415 



Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp 
420 425 430 

Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly 





435 440 445 

Gly Pro Pro Cys Asn He Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys 
450 455 460 

Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Ala Arg Cys 
465 470 475 480 

Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His Tyr Pro Tyr 
485 490 495 

Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr lie Phe Lys Val 
500 505 510 

Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala Ala Cys Asn 
515 520 525 

„Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu 
P 530 535 540 

ffiLeu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin He Leu Pro Cys 
{Cj 54 5 550 555 560 

Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu lie His Leu His 
565 570 575 

WGln Asn He Val Aso Val Gin Tyr Leu Tyr Gly Val Gly Ser Ala Val 
g 580 * 585 590 

Ci Val Ser Leu Val lie Lys Trp Glu Tyr Val Leu Leu Leu Phe Leu Leu 
595 600 605 

3 i 

Oj Leu Ala Asp Ala Arg He Cys Ala Cys Leu Trp Met Met Leu Leu He 
610 615 620 

Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu Val Val Leu Asn Ala Ala 
625 630 635 640 

Ala Val Ala Gly Ala His Gly Thr Leu Ser Phe Leu Val Phe Phe Cys 
645 650 655 

Ala Ala Trp Tyr He Lys Gly Arg Leu Val Pro Gly Ala Ala Tyr Ala 
660 665 670 

Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu Leu Ala Leu Pro Pro 
675 680 685 

Arg Ala Tyr Ala 
690 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2433 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



utvcmv-T 



(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..2430 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1..2427 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

ATG AGC ACG AAT CCT AAA CCT CAA AGA AAA ACC AAA CGT AAC ACC AAC 4 8 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
n l 5 J> 10 15 

fTiCGC CGC CCA CAG GAC GTC AAG TTC CCG GGC GGT GGT CAG ATC GTT GGT 96 
"^Arg Arg Pro Gin Asp Val Lys Phe Pro Giy Giy Gly Gin He Val Gly 
^ 20 25 30 

WgGA GTT TAC CTG TTG CCG CGC -AGG GGC CCC AGG TTG GGT GTG CGC GCG 14 4 

GiGly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
u\ 35 40 45 

L ; ACT AGG AAG ACT TCC GAG CGG TCG CAA CCT CGT GGG AGG CGA CAA CCT 192 
^ Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
N 50 55 60 

ff\ ATC CCC AAG GCT CGC CGA CCC GAG GGT AGG GCC TGG GCT CAG CCC GGG 24 0 

h He Pro Lys Ala Arg Arg Pro Glu Gly Arg Ala Trp Ala Gin Pro Gly 
C! 65 70 75 80 

TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC ATG GGG TGG GCA GGA TGG 288 
Tyr Pro Trp Pro Leu Tyr Giy Asn Glu Gly Met Gly Tr? Ala Gly Trp 
85 90 95 

CTC CTG TCA CCC CGC GGC TCT CGG CCT AGT TGG GGC CCT ACA GAC CCC 336 
Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp ^Gly Pro Thr Asp Pro 
100 105 110 

CGG CGT AGG TCG CGT AAT TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC 384 
Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 

GGC TTC GCC GAC CTC GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA 4 32 

Gly Phe Ala Asp Leu Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu 
130 135 140 

GGG GGC GCT GCC AGG GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC 4 80 

Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

GGC GTG AAC TAT GCA ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC 528 
Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 175 

TTC CTC TTG GCT TTG CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT 576 
Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 



180 



185 



190 



GAA 
Glu 



AAC 
Asn 



GGG 
Gly 
225 

GCG 
Ala 



GTG CGC AAC GTG 
Val Arg Asn Val 
195 

TCA AGC ATT GTG 
Ser Ser He Val 
210 

TGC GTG CCC TGC 
Cys Val Pro Cys 



CTC ACC 
Leu Thr 



CCC ACG 
Pro Thr 
245 



TCC GGG ATG TAC CAT GTC ACG AAC 
Ser Gly Met Tyr His Val Thr Asn 
200 205 

TAT GAG GCA GCG GAC ATG ATC ATG 
Tyr Glu Ala Ala Asp Met He Met 
215 220 

GTT CGG GAG AAC AAC TCT TCC CGC 
Val Arg Glu Asn Asn Ser Ser Arg 
230 235 

CTC GCA GCT AGG AAC GCC AGC GTC 
Leu Ala Ala Arg Asn Ala Ser Val 
250 



GAC TGC TCC 
Asp Cys Ser 



CAC ACC CCC 
His Thr Pro 



TGC TGG GTA 
Cys Trp Val 
240 

CCC ACC ACG 
Pro Thr Thr 
255 



624 



672 



720 



768 



OACA 
jiThr 

II 

=! TCC 
= j Ser 



CAG 

:Gln 



] AAT 
iAsn 
J 305 

GAT 
Asp 



CTG 
Leu 



TGG 
Trp 



GCT 
Ala 



ACC 
Thr 
385 

TCC 
Ser 



AAC 
Asn 



ATA CGA 
lie Arg 



GCT ATG 
Ala Met 
275 

CTG TTC 
Leu Phe 
290 

TGC TCA 
Cys Ser 



CGC CAC 
Arg His 
260 

TAC GTG 
Tyr Val 



ACC ATC 
Thr He 



ATC TAT 
He Tyr 



GTC GAT 
Val Asp 



GGG GAC 
Gly Asp 



ATG ATG ATG AAC 
Met Met Met Asn 
325 

CTC CGG ATC CCA 
Leu Arg He Pro 
340 

GGA GTC CTG GCG 
Gly Val Leu Ala 
355 

AAG GTT TTG GTT 
Lys Val Leu Val 
370 

CGC GTG TCA GGA 
Arg Val Ser Gly 



CTC TTT AGC CCC 
Leu Phe Ser Pro 
405 

GGC AGT TGG CAC 
Gly Ser Trp His 
420 



TCG CCT 
Ser Pro 
295 

CCC GGC 
Pro Gly 
310 

TGG TCG 
Trp Ser 



CAA GCT 
Gin Ala 



GGC CTC 
Gly Leu 



GTG ATG 
Val Met 
375 

GGG GCA 
Gly Ala 
390 

GGG TCG 
Gly Ser 



ATC AAC 
He Asn 



TTG CTC 
Leu Leu 
265 

CTC TGC 
Leu Cys 
280 

CGC CGG 
Arg Arg 



GTT GGG GCG GCT 
Val Gly Ala Ala 



CAC ATA 
His He 



CCT ACA 
Pro Thr 



GTC GTG 
Val Val 
345 

GCC TAC 
Ala Tyr 
360 

CTA CTC 
Leu Leu 



GGA TCT 
Gly Ser 



CAT GAG 
His Glu 



ACG GGT 
Thr Gly 
315 

ACG GCC 
Thr Ala 
330 

GAC ATG 
Asd Met 



TAT TCC 
Tyr Ser 



TTT GCC 
Phe Ala 



GTC TTC 
Val Phe 
285 

ACG GTG 
Thr Val 
300 

CAC CGT 
His Arg 



GCA GCC 
Ala Ala 



GCT CAG 
Ala Gin 



AGG ACT 
Arg Thr 
425 



TCC GAT 
Ser Asp 
395 

AAA ATC 
Lys He 
410 

GCC CTG 
Ala Leu 



CTG GTG 
Leu Val 



GTG GCG 
Val Ala 



ATG GTG 
Met Val 
365 

GGC GTC 
Gly Val 
380 

ACC AGG 
Thr Arg 



CAG CTC 
Gin Leu 



AAC TGC 
Asn Cys 



GCT TTC TGT 
Ala Phe Cys 
270 

CTC GTC TCC 
Leu Val Ser 



CAG GAC TGC 
Gin Asp Cys 



ATG GCT TGG 
Met Ala Trp 
• 320 

GTA TCG CAG 
Val Ser Gin 
335 

GGG GCC CAT 
Gly Ala His 
350 

GGG AAC TGG 
Gly Asn Trp 



GAC GGG CAT 
Asp Gly His 



GGC CTT GTG 
Gly Leu Val 
400. 

GTA AAC ACC 
Val Asn Thr 
415 

AAC GAC TCC 
Asn Asp Ser 
430 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 





CTC 
Leu 


CAA 
Gin 


ACA GGG 
Thr Gly 
435 


TTC 
Phe 


TTT 
Phe 


GCC 
Ala 


GCA 
Ala 
440 


CTA 
Leu 


TTC 
Phe 


TAC 
Tyr 


AAA 
Lys 


CAC 
His 
445 


AAA 
Lys 


TTC 
Phe 


AAC 
Asn 


1344 


TCG 
Ser 


TCT 
Ser 
450 


GGA TGC 
Gly Cys 


CCA 
Pro 


GAG 
Glu 


CGC 
Arg 
455 


TTG 
Leu 


GCC 
Ala 


AGC 
Ser 


TGT 
Cys 


CGC 
Arg 
460 


TCC 
Ser 


ATC 
He 


GAC 
Asp 


AAG 
Lys 


1392 


TTC 
Phe 
465 


GCT 
Ala 


CAG GGG 
Gin Gly 


TGG 
Trp 


GGT 
Gly 
470 


CCC 
Pro 


CTC 
Leu 


ACT 
Thr 


TAC 
Tyr 


ACT 
Thr 
475 


GAG 
Glu 


CCT 
Pro 


AAC 
Asn 


AGC 
Ser 


TCG 
Ser 
480 


1440 


GAC 
Asp 


CAG 
Gin 


AGG 
Arg 


CCC 
Pro 


TAC 
Tyr 
485 


TGC 
Cys 


TGG 
Trp 


CAC 
His 


TAC 
Tyr 


GCG 
Ala 
4 90 


CCT CGA 
Pro Arg 


CCG 
Pro 


TGT 
Cys 


GGT 
Gly 
495 


ATT 
He 


1488 


¥gta 

tfjval 


CCC 
Pro 


GCG 
Ala 


TCT 
Ser 
500 


CAG 
Gin 


GTG 
Val 


TGC 
Cys 


GGT 
Gly 


CCA 
Pro 
505 


GTG 
Val 


TAT 
Tyr 


TGC 
Cys 


TTC 
Phe 


ACC 
Thr 
510 


CCG 
Pro 


AGC 
Ser 


1536 


iffCCT 
[7; Pro 


GTT 
Val 


GTG 
Val 
515 


GTG 
Val 


GGG 
Gly 


ACG 
Thr 


ACC 
Thr 


GAT 
Asp 
520 


CGG 
Arg 


TTT 
Phe 


GGT GTC 
Gly Val 


CCC 
Pro 
525 


ACG 
Thr 


TAT 
Tyr 


AAC 
Asn 


1584 


Wtgg 

s Trp 


GGG 
Glv 
530 


GCG 
Ala 


AAC 
Asn 


GAC 
Asp 


TCG 
Ser 


GAT 
Asp 
535 


GTG 
Val 


CTG 
Leu 


ATT 
lie 


CTC 
Leu 


AAC 
Asn 
540 


AAC 
Asn 


ACG 
Thr 


CGG 
Arg 


CCG 
Pro 


1632 


CCG 

Pro 

«; 545 


CGA 
Arg 


GGC AAC 
Gly Asn 


TGG 
Trp 


TTC 
Phe 
550 


GGC 
Gly 


TGT 
Cys 


ACA 
Thr 


TGG 
Tr? 


ATG 
Met 
555 


AAT 
Asn 


GGC 
Gly 


ACT GGG 
Thr Gly 


TTC 
Phe 
560 


1680 


N ; ACC 
Thr 


AAG 
Lys 


ACG 
Thr 


TGT 
Cys 


GGG 
Gly 
565 


GGC 
Gly 


CCC 
Pro 


CCG 
Pro 


TGC 
Cys 


AAC 
Asn 
570 


ATC 
lie 


GGG 
Gly 


GGG GCC 
Gly Ala 


GGC AAC 
Gly Asn 
575 


1728 


AAC 
Asn 


ACC 
Thr 


TTG 
Leu 


ACC 
Thr 
580 


TGC 
Cys 


CCC 
Pro 


ACT 
Thr 


GAC 
Asp 


TGT 
Cys 
585 


TTT 
Phe 


CGG 
Arg 


AAG 
Lys 


CAC 
His 


CCC 
Pro 
590 


GAG 
Glu 


GCC 
Ala 


1776 


ACC 
Thr 


TAC GCC AGA 
Tyr Ala Arg 
595 


TGC 
Cys 


GGT 
Gly 


TCT 
Ser 


GGG 
Gly 
600 


CCC 
Pro 


TGG 
Trp 


CTG 
Leu 


ACA 
Thr 


CCT AGG TGT 
Pro Arg Cys 
605 


ATG 
Met 


1824 


GTT 
Val 


CAT 
His 
610 


TAC 
Tyr 


CCA 
Pro 


TAT 
Tyr 


AGG 
Arg 


CTC 
Leu 
615 


TGG 
Trp 


CAC 
His 


TAC 
Tyr 


CCC 
Pro 


TGC 
Cys 
620 


ACT 
Thr 


GTC 
Val 


AAC 
Asn 


TTC 
Phe 


1872 


ACC 
Thr 
625 


ATC 
He 


TTC 
Phe 


AAG 
Lys 


GTT 
Val 


AGG 
Arg 
630 


ATG 
Met 


TAC 
Tyr 


GTG GGG GGC GTG 
Val Gly Gly Val 
635 


GAG 
Glu 


CAC 
His 


AGG 
Arg 


TTC 
Phe 
640 


1920 


GAA 
Glu 


GCC 
Ala 


GCA 
Ala 


TGC 
Cys 


AAT 
Asn 
645 


TGG 
Trp 


ACT 
Thr 


CGA 
Arg 


GGA GAG 
Gly Glu 
650 


CGT TGT 
Arg Cys 


GAC 
Asp 


TTG 
Leu 


GAG GAC 
Glu Asp 
655 


1968 


AGG 
Arg 


GAT AGA 
Asp Arg 


TCA 
Ser 
660 


GAG 
Glu 


CTT 
Leu 


AGC 
Ser 


CCG 
Pro 


CTG 
Leu 
665 


CTG 
Leu 


CTG 
Leu 


TCT 
Ser 


ACA 
Thr 


ACA 
Thr 
670 


GAG 
Glu 


TGG 
Trp 


2016 



CAG 


ATA 


GIG 


ppp 


TPT 


1 GG 


ttp 
1 IG 


AGG 




PTC 
Liu 


CCG 


GCC CTA 


TCC 


ACC 


GGC 


2064 


Gin 


He 


Leu 


Pro 


Cys 


Ser 


Phe 


Thr 


Thr 


Leu 


Pro 


Ala Leu 


Ser 


Thr 


Gly 








675 








680 








685 










CTG 


ATC 


CAC 


u it- 


p n t 


GAG 


a zip 




PTP. 


P.AP 


GTG 


CAA TAC 


CTG 


TAC 


GGT 


2112 


Leu 


lie 


His 


Leu 


His 


Gin 


Asn 


lie 


Val 


Asp 


Val Gin Tyr Leu Tyr Gly 






690 










695 










700 










GTA 


GGG 


TCG 


GCG 


GTT 


G iG 


I GG 


PTT 
LI 1 


GIL 


rtp 


AAA 


TGG GAG 


TAT 


GTC 


CTG 


9i fin 


Val 


Gly 


Ser 


Ala 


Val 


Val 


Ser 


Leu 


Val 


He 


Lys 


Trp Glu Tyr Val 


Leu 




705 








710 










715 








Tin 

720 




TTG 


CTC 


TTC 


CTT 


CTC 


CTG 


GCA 


GAC 


GGG 


ppp 

GGG 


ATC 


TGC GCC 


TGC 


TTA 


TGG 


Z ZU 0 


Leu 


Leu 


Phe 


Leu 


Leu 


Leu 


Ala 


Asp 


Ala 


Arg 


He 


Cys Ala 


Cys 


Leu 


Trp 












725 










730 








735 






ATG 


ATG 


CTG 


CTG 


ATA 


GG 1 


GAA 


PPT 

GG i 


pap 
GAG 


ppp 

GGG 


GCC 


TTA GAG 


AAC 


CTG 


GTG 


9 9 5 fi 

Z Z D 0 


J$et 


Met 


Leu 


Leu 


He 


Ala 


Gin 


Ala 


Glu 


Ala 


Ala 


Leu Glu 


Asn 


Leu 


Val 










740 










745 








750 










CTG 




PPP 

GGG 


ppp 
bib 






ppp 


ppp 


ppp 


CAT 


GGC ACT 


CTT 


TCC 


TTC 


9104 




Leu 


Asn 


Ala 


Ala 


Ala 


Val 


Ala 


Gly 


Ala 


His 


Gly Thr 


Leu 


Ser 


Phe 








755 










760 








765 












GTG 


TTC 


TTC 


TGT 


GCT 


GCC 


TGG 


TAC 


ATC 


AAG 


GGC AGG 


CTG 


GTC 


CCT 


2352 


H;eu 


Val 


Phe 


Phe 


Cys 


Ala 


Ala 


Trp 


Tyr 


He 


Lys 


Gly Arg 


Leu 


Val 


Pro 




yy 


770 








775 










780 










5 

[3GT 


GCG 


GCA 


TAC 


GCC 


TTC 


TAT 


GGC 


GTG 


TGG 


CCG 


CTG CTC 


CTG 


CTT 


CTG 


2400 


=~Gly 


Ala 


Ala 


Tyr 


Ala 


Phe 


Tyr 


Gly 


Val 


Trp 


Pro 


Leu Leu 


Leu 


Leu 


Leu 




485 








790 










795 








800 




S TG 


GCC 


TTA 


CCA 


CCA 


CGA 


GCT 


TAT 


GCC 


TAGTAA 










2433 


^eu 


Ala 


Leu 


Pro 


Pro 


Arg 


Ala 


Tyr 


Ala 
























805 










810 















(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 809 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
1 5 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

He Pro Lys Ala Arg Arg Pro Glu Gly Arg Ala Trp Ala Gin Pro Gly 
65 70 75 80 



Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Met Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 110 

Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
_ 165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 
EE 180 185 190 

Glu Val Arg Asn Val Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser 
^ 195 ' 200 205 

W 

& Asn Ser Ser He Val Tyr Glu Ala Ala Asp Met He Met His Thr Pro 
Uj 210 215 220 

^< Gly Cys Val Pro Cys Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val 
^ 225 230 235 240 

"SI 

S Ala Leu Thr Pro Thr Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr 
y! 245 250 255 

y'Thr He Arg Arg His Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys 
260 265 270 

Ser Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser 
275 280 285 

Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys 
■ 290 295 300 

Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg Met Ala Trp 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin 
325 330 335 

Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His 
340 345 350 

Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp 
355 360 365 

Ala Lys Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His 
370 375 380 

Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val 
385 390 395 400 





Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr 
405 410 415 

Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 
420 425 430 

Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn 
435 440 445 

Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys 
450 455 460 

Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser 
465 470 475 480 

Asd Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He 
485 490 495 

?Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser 
^ 500 505 510 

*DPro Val Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn 
£l 515 520 525 

- Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro 
H ' 530 535 540 

= p^o Arg Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe 
0545 550 555 560 

?==:Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He Gly Gly Ala Gly Asn 
2; 565 570 575 

OAsn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala 
M> 580 585 590 

Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met 
595 600 605 

Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe 
610 615 620 

Thr He Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe 
625 630 635 640 

Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp 
645 650 655 

Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp 
660 665 670 

Gin He Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly 
675 680 685 

Leu He His Leu His Gin Asn He Val Asp Val Gin Tyr Leu Tyr Gly 
690 695 700 

Val Gly Ser Ala Val Val Ser Leu Val He Lys Trp Glu Tyr Val Leu 
705 710 • 715 720 

Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg He Cys Ala Cys Leu Trp 



725 730 735 

Met Met Leu Leu lie Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu Val 
740 745 750 

Val Leu Asn Ala Ala Ala Val Ala Gly Ala His Gly Thr Leu Ser Phe 
755 760 765 

Leu' Val Phe Phe Cys Ala Ala Trp Tyr He Lys Gly Arg Leu Val Pro 
770 775 780 

Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu 
785 790 795 800 

Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
805 



^(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 
(3) LOCATION: 1 . . 17 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 



Se*- Asn Ser Ser Glu Ala Ala Aso Met He Met His Thr Pro Gly Cys 
15 10 15 



Val 



(2) INFORMATION FOR SEQ ID NO: 52: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 1..22 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Gly Gly He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 



HPA/FNY = T 



10 15 



Ser Pro Thr Thr Ala Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 1..37 



; 3 : 



Li.! 

3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Tyr Glu Val Arg Asn Val Ser Gly He Tyr His Val Thr Asn Asp Cys 
1 5 10 15 

Se*- Asn Ser Ser He Val Tvr Glu Ala Ala Asp Met He Met His Thr 
20 ~ 25 30 

Pro Gly Cys Gly Lys 
35 



yi(2) INFORMATION FOR SEQ ID NO: 54: 

U (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 1..25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Gly Gly Thr Pro Thr Val Ala Thr Arg Asp Gly Lys Leu Pro Ala Thr 
15 10 15 

Gin Leu Arg Arg His He Asp Leu Leu 
20 25 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 amino acids 
(3) TYPE: amino acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 1..25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Giy Gly Thr Pro Thr Leu Ala Ala Arg Asp Ala Ser Val Pro Thr Thr 
15 10 15 

Thr He Arg Arg His Val Asp Leu Leu 
20 25 

•,f\ 

^ (2) INFORMATION FOR SEQ ID NO: 56: 

€i (i) SEQUENCE CHARACTERISTICS: 

\0 (A) LENGTH: 20 amino acids 

hi (B) TYPE: amino acid 

fjjji (C) STRANDEDNESS: single 

{ D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Gin Val Arg Asn 
15 10 15 

Ser Thr Gly Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Gin Val Arg Asn Ser Thr Gly Leu Tyr His Val Thr Asn Asp Cys Pro 
1 5 10 . 15 

Asn Ser Ser He 
20 

(2) INFORMATION FOR SEQ ID NO: 58: 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Asn Asp Cys Pro Asn Ser Ser He Val Tyr Glu Ala His Asp Ala He 
15 10 15 

Leu His Thr Pro 
20 

S{2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Ser Asn Ser Ser He Val Tyr Glu Ala Ala Asp Met He Met His Thr 
1 5 10 15 

Pro Gly Cys Val 
20 

(2) INFORMATION FOR SEQ ID NO: 60: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

His Asp Ala He Leu His Thr Pro Gly Val Pro Cys Val Arg Glu Gly 
1 -5 10 15 

Asn Val Ser 



(2) INFORMATION FOR SEQ ID NO: 61: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Cys Val Arg Glu Gly Asn Val Ser Arg Cys Trp Val Ala Met Thr Pro 
15 10 15 

Thr Val Ala Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 
:; (A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

: 

.) 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Ala Met Thr Pro Thr Val Ala Thr Arg Asp Gly Lys Leu Pro Ala Thr 
-j 1 5 10 15 

Gin Leu Arg Arg 

5i 20 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Leu Pro Ala Thr Gin Leu Arg Arg His He Asp Leu Leu Val Gly Ser 
15 10 15 



Ala Thr Leu Cys 
20 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 



wry t*z\*\fjr 



«• 90 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



7-5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Leu Val Gly Ser Ala Thr Leu Cys Ser Ala Leu Tyr Val Gly Asp Leu 
15 10 15 

Cys Gly Ser Val 
20 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

3 (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 65: 

Gin Leu Phe Thr Phe Ser Pro Arg Arg His Trp Thr Thr Gin Gly Cys 
15 10 15 

Asn Cys Ser lie 
20 



^ ! (2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Thr Gin Gly Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His 
15 10 15 

Arg Met Ala Trp 
20 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 



HCVENV-T 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp Ser Pro 
15 10 15 

Thr Ala Ala Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

I (ii) MOLECULE TYPE: peptide 



hj (xi)' SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

L Asn Trp Ser Pro Thr Ala Ala Leu Val Met Ala Gin Leu Leu Arg He 
U 1 5 10 15 

w 

Ci Pro Gin Ala He 

y ! 20 

"(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Leu Leu Arq He Pro Gin Ala He Leu Asp Met He Ala Gly Ala His 
15 10 15 



Tro Gly Val Leu 
20 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 



Ala Gly Ala His Trp Gly Val Leu Ala Gly He Ala Tyr Phe Ser Met 
15 10 15 

Val Gly Asn Met 
20 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

I (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

j j (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Val Val Leu Leu Leu Phe Ala Gly Val Asp Ala Glu Thr He Val Ser 
U l 5 10 15 

^ i 

D Gly Gly Gin Ala 

![T(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Ser Gly Leu Val Ser Leu Phe Thr Pro Gly Ala Lys Gin Asn He Gin 
15 10 15 

Leu He Asn Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 
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2 I 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Gin Asn lie Gin Leu lie Asn Thr Asn Gly Gin Trp His lie Asn Ser 
15 10 15 

Thr Ala Leu Asn 
20 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

O (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
ffi (ii) MOLECULE TYPE: peptide 

W (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Leu Asn Cys Asn Glu Ser Leu Asn Thr Gly Trp Tro Leu Ala Gly Leu 
15 10 15 

lie Tyr Gin His Lys 

^ 20 

fjf{2) INFORMATION FOR SEQ ID NO: 75: 

yi (i) SEQUENCE CHARACTERISTICS: 

r ' (A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

. (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Ala Gly Leu He Tyr Gin His Lys Phe Asn Ser Ser Gly Cys Pro Glu 
15 10 15 

Arg Leu Ala Ser 
20 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Pro Leu Thr Asp Phe Asp 
15 10 15 

Gin Gly Trp Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
3 (D) TOPOLOGY: linear 



ll! 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Thr Asp Phe Asp Gin Gly Trp Gly Pro He Ser Tyr Ala Asn Gly Ser 
15 10 15 



"« 5 Gly Pro Asp Gin 

Ni 20 

C3 

CR2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 
5 ' (A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Ala Asn Gly Ser Gly Pro Asp Gin Arg Pro Tyr Cys Trp His Tyr Pro 
15 10 15 

Pro Lys Pro Cys 
20 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 





(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 



Trp His Tyr Pro Pro Lys Pro Cys Gly lie Val Pro Ala Lys Ser Val 
1-5 10 15 

Cys Gly Pro Val 



(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

% (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Ala Lys Ser Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val 
15 10 15 

Val Val Gly Thr 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Ser Gly Ala Pro Thr 
15 10 15 

Tyr Ser Trp Gly 



(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



20 



f^2) INFORMATION FOR SEQ ID NO: 81: 



20 



(ii) MOLECULE TYPE: peptide 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Gly Ala Pro Thr Tyr Ser Trp Gly Glu Asn Asp Thr Asp Val Phe Val 
15 10 15 

Leu Asn Asn Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



S3 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

~i Gly Asn Tro Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr Lys 

r=: 1 * 5 10 "15 

^ Val Cys Gly Ala 

W 20 

32) INFORMATION FOR SEQ ID NO: 84: 

»j (i) SEQUENCE CHARACTERISTICS: 

3; (A) LENGTH: 20 amino acids 

yj (B) TYPE: amino acid 

Q (C) STRANDEDNESS: single 

M< (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Gly Phe Thr Lys Val Cys Gly Ala Pro Pro Val Cys He Gly Gly Ala 
15 10 15 

Gly Asn Asn Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 




He Gly Gly Ala Gly Asn Asn Thr Leu His Cys Pro Thr Asp Cys Arg 
1 5 10 15 



Lys His Pro 



(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



._r'i (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

^ Thr Aso Cys.Phe Arg Lys His Pro Asd Ala Thr Tyr Ser Arg Cys Gly 

*U 1 " 5 10 .15 

k! Ser Gly Pro Trp 

T2) INFORMATION FOR SEQ ID NO: 87: 

O (i) SEQUENCE CHARACTERISTICS: 

Si (A) LENGTH: 20 amino acids 

Q (B) TYPE: amino acid 

fp (C) STRANDEDNESS: single 

^ (D) TOPOLOGY: linear 

^ ! (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

Ser Arg Cys Gly Ser Gly Pro Trp He Thr Pro Arg Cys Leu Val Asp 
15 10 15 

Tyr Pro Tyr Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Cys Leu Val Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr He 



10 



15 



Asn^Tyr' Thr~ He 
20 



(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

Pro Cys Thr He Asn Tyr Thr He Phe Lys He Arg Met Tyr Val Gly 
15 10 15 

Gly Val Glu His 
20 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Met Tyr Val Gly Gly Val Glu His Arg Leu Glu Ala Ala Cys Asn Trp 
15 10 15 

Thr Pro Gly Glu 
20 



(2) INFORMATION FOR SEQ ID NO: 91: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

Ala Cys Asn Trp Thr Pro Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp 
15 10 15 
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Arg Ser Glu Leu 
20 



(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



C (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

ffi Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Thr Thr Thr 
1 5 10 15 



Gin Trp Gin Val 
W 20 

h$2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

Tyr Gin Val Arg Asn Ser Thr Gly Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
ACGTCCGTAC GTTCGAATTA ATTAATCGA 



29 





(2) INFORMATION FOR^ SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



Q (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

S!cTCCGGACG TGCACTAGCT CCCGTCTGTG GTAGTGGTGG TAGTGATTAT CAATTAATTG 60 

W 

h£2) INFORMATION FOR SEQ ID NO: 96: 

T?\ (i) SEQUENCE CHARACTERISTICS: 

^ : (A) LENGTH: 19 base pairs 

^ (B) TYPE: nucleic acid 

%J (C) STRANDEDNESS: single 

SI (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
GTTTAACCAC TGCATGATG 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 



GTCCCATCGA GTGCGGCTAC 
(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



* (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
JKGTGACATGG TACATTCCGG ACACTTGGCG CACTTCATAA GCGGA 
^7(2) INFORMATION FOR SEQ ID NO: 99: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

MOLECULE TYPE: DNA (genomic) 
HYPOTHETICAL: NO 
ANTI-SENSE: NO 



SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
TGCCTCATAC ACAATGGAGC TCTGGGACGA GTCGTTCGTG AC 
(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 



p (iii) 

a ""* 

(iii) 
(xi) 



TACCCAGCAG CGGGAGCTCT GTTGCTCCCG AACGCAGGGC AC 
' ( 2 r '^INFdRHaTItifi M FOR"~SEQ Itr*NO*^ IrO l : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



™ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

C&GTCGTGGTG GGGACGGAGG CCTGCCTAGC TGCGAGCGTG GG 
%p2) INFORMATION FOR SEQ ID NO: 102: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
CGTTATGTGG CCCGGGTAGA TTGAGCACTG GCAGTCCTGC ACCGTCTC 
(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
CAGGGCCGTT CTAGGCCTCC ACTGCATCAT CATATCCCAA GC 



(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CCGGAATGTA CCATGTCACG AACGAC 
j(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE : NO 



! : (xi) SEQUENCE DESCRIPTION: SEQ ID NO 

GCTCCATTGT GTATGAGGCA GCGG 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GAGCTCCCGC TGCTGGGTAG CGC 
(2) INFORMATION FOR SEQ ID NO: 107: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single - 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

CCTCCGTCCC CACCACGACA ATACG 25 

(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 
J; (A) LENGTH: 27 base pairs 

^ (3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
=£! (D) TOPOLOGY: linear 

W 

n (ii) MOLECULE TYPE: DNA (genomic) 

W (iii) HYPOTHETICAL: NO 
s 

O (iii) ANTI-SENSE: NO 

%[ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

H:tacccgggc CACATAACGG GTCACCG 27 
(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
.(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

GGAGGCCTAC AACGGCCCTG GTGG 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

TTCTATCGAT TAAATAGAAT TC 22 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 
q (A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
W (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



*| (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

aCCATACGCT CACAGCCGAT CCC 23 
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